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Abstract. We construct a compactification of the moduli space of twisted holomor- 
phic maps with varying complex structure and bounded energy. For a given compact 
symplectic manifold X with a compatible complex structure and a Hamiltonian action 
of S 1 with moment map /i : X — > iR, the moduli space which we compactify consists of 
equivalence classes of tuples (C, P, A, tp), where C is a smooth compact complex curve 
of fixed genus, P is a principal S 1 bundle over C, A is a connection on P and is a 
section of P x s i X satisfying 



where Fa is the curvature of A, v is the restriction on C of a volume form on the univer- 
sal curve over A4 g and c is a fixed constant. Two tuples (C, P, A, <f) and (C, P', A', (/>') 
are equivalent if there is a morphism of bundles p : P — > P' lifting a biholomor- 
phism C — > C such that p* A' = A and p*<f>' = <\>. The energy of (C,P,A,4>) is 
ll-^Allia + IMA^Ilia + HM'W ~ c IIi,2j an d the topology of the moduli space is the natural 
one. We also incorporate marked points in the picture. 

There are two sources of non compactness. First, bubbling off phenomena, analogous 
to the one in Gromov-Witten theory. Second, degeneration of C to nodal curves. In 
this case, there appears a phenomenon which is not present in Gromov-Witten: near 
the nodes, the section <fi may degenerate to a chain of gradient flow lines of — ip. 



d A <t> = 0, 



l v F a + p((f>) 
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1. Introduction 

1.1. Let C be a compact smooth complex curve with a volume form u, and let X be 
a compact symplectic manifold with a compatible almost complex structure. Suppose 
that X supports a Hamiltonian action of S* 1 with moment map : X — > iR. A twisted 
holomorphic map (see [M], where the same object was called twisted holomorphic curve) 
consists of a principal S* 1 bundle P over C, a connection A on P and a section of the 
associated bundle P x 51 X, satisfying the equations 

8a4> — and ^Fa + /i(0) = c, (1.1) 

where F^ denotes the curvature of A, v is a volume form on C, and c G iR is a constant 
(see Section 2 for the definition of 3a4>)- The second equation is called the vortex 
equation. Two triples (P, A, 0) and (P', A', 0') are said to be equivalent if there is an 
isomorphism of bundles p : P — > P' which lifts an automorphism of the curve C and 
such that p*A' = A and p*0' = (note that in [M] we only consider isomorphisms of 
bundles lifting the identity, i.e., gauge transformations). The set of isomorphism classes 
of triples (P, A, 0) carries a natural topology, and the resulting topological space M. is 
what we call the moduli space of twisted holomorphic maps over C. There is a notion 
of energy for triples of the form (P, A, 0), called the Yang-Mills-Higgs functional: 

yMH c {P,A,4>) := \\F A \\ 2 L2 + \\d A <P\\ 2 L2 + ||M0) - c||| 2 . (1.2) 

Given a number K > 0, define M(K) to be the subset of M. consisting of isomor- 
phism classes of triples (P, A, 0) with energy < K. The space A4(K) is not compact, 
but the only source of noncompactness is the bubbling off phenomenon well know in 
Gromov-Witten theory. Thus, it is not a surprise that essentially the same methods as 
in Gromov-Witten theory (combined with some standard techniques in gauge theory) 
allow to construct a compactification of M. (K) . This was done in [M] , and the resulting 
compactification was used to construct what we call the Hamiltonian Gromov-Witten 
invariants of X, which are very analogous to Gromov-Witten invariants but depend not 
only on a symplectic structure but also on a Hamiltonian action of S 1 (both things up 
to deformation). 

1.2. Now, in the general theory of Gromov-Witten invariants one considers maps from 
curves C to X representing a given homology class, and only the genus of C is fixed: 
its complex structure varies along the moduli space of curves of the given genus. Look- 
ing for a compactification of the resulting moduli space leads to the notion of stable 
map introduced by Kontsevich in [Ko], which consist of a compact curve C with nodal 
singularities and with some marked points ), and a holomorphic map 
/ : C — > X. The only condition on (C, x, /) is that any rational component of C on 
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which / restricts to a trivial map has to have at least 3 exceptional (that is, marked 
or singular) points. A definition of Gromov-Witten invariants using moduli spaces of 
stable maps was given independently by several authors [FO, LT, S, R] for general com- 
pact symplectic manifolds, extending an earlier definition of Gromov-Witten invariants 
by Ruan and Tian [RT1, RT2] for semipositive symplectic manifolds. One of the most 
important features of this theory comes from the fact that the moduli space of stable 
curves fibres in some sense over the Deligne-Mumford moduli space of stable marked 
curves M. g , n - This allows to introduce into the play the rich structure in the cohomology 
of M.g, n - A beautiful and simple example which shows how powerful this point of view is 
is the proof of the associativity of the small quantum product in terms of the geometry 
of -Mo,4- More generally, the gluing formula or composition axiom encodes the structure 
in Gromov-Witten invariants inherited by the properties of the cohomology of Aig, n - 

In view of this, it seems desirable to generalize the construction given in [M] replacing 
the fixed curve C by arbitrary marked stable curves, constructing in this way a compact 
moduli space which fibres over Ai 9) n and obtaining richer invariants. The purpose of 
this paper is to make a first step towards this aim: we define stable twisted holomorphic 
maps (c-STHM for short), we give a notion of energy of a c-STHM (the Yang-Mills-Higgs 
functional), we define a topology on the set of isomorphism classes of c-STHM's, and we 
prove that the set of isomorphism classes c-STHM's with bounded energy is compact. 



1.3. The definition of c-STHM involves several new features which do not appear in the 
definition of twisted holomorphic maps. The first one is that on nodal curves one has to 
consider connections whose pullback to the normalization is not necessarily smooth, but 
only meromorphic. This means that the connection does not extend to the preimages of 
the nodes, but its curvature extends continuously. To motivate this, consider a sequence 
of smooth curves C u converging to a curve C with a nodal singularity, and choose simple 
closed curves 7 M C C u representing the vanishing cycle. If we take bundles P u and 
connections on A u , it may perfectly happen that the holonomy of A u along 7 M (which is 
an element of S l ) converges to an element different from the identity as u goes to oo. 
If the curvature of A u is uniformly bounded in compact sets disjoint with j u , passing 
to a subsequence we obtain a limit bundle P and connection A defined on the smooth 
locus of C, but A does not extend smoothly in the normalization of C because of the 
nontrivial holonomy. (The reason why we consider connections whose curvature extends 
continuously will be shortly clarified.) 

An important notion related to meromorphic connections is that of limit holonomy: 
given a meromorphic connection A on the punctured disk D* with a pole at the origin, one 
can define the (limit) holonomy of A around the origin to be the limit of the holonomies 
around circles centered in the origin (and oriented in a way compatible with the complex 
structure) as the radius converges to 0. This limit exists because the curvature of A 
extends continuously to D (in fact boundedness is enough). The connection A extends 
to a connection on the whole disk D (of type C 1 ) if and only if the holonomy is trivial 
(see Corollary 3.2). 
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Let C be the normalization of C and let y, y' be the preimages of the singular point z G 
C . The holonomy of the connection A around y is equal to the inverse of the holonomy 
around y', because both holonomies are obtained as the limits of the holonomies of A u 
along 7„, one using an orientation and the other using the opposite one. 

1.4. Suppose now that we have sections n of P u x s i X which satisfy equations (1.1) 
(this involves a choice of volume form in each curve C u , but for the moment we will 
ignore this issue — see Section 5.4). Assume that the norm of the covariant derivatives 
is uniformly bounded away from the vanishing cycles. Then the same standard 
arguments as in [M] allow to obtain, passing to a subsequence and regauging, a limit 
triple (P, A, 0) defined on the smooth locus of C. Furthermore, one can prove easily that 
||oU0||l 2 is bounded and that (P, A, 0) satisfies equations (1.1). 

An important question to understand about (P, A, 0) is the asymptotic behavior of 
as we approach the singularity of C. If the holonomy of A around the singular point is 
trivial, then A extends the normalization C (because its curvature is uniformly bounded), 
and then Gromov's theorem on removal of singularities (as proved for continuous almost 
complex structures in [IS]) proves that extends also to C . (At this point elliptic 
bootstrapping using (1.1) proves that in fact both A and extend smoothly to C.) To 
describe what happens when the holonomy is nontrivial, let us consider the real blow up 
of C at y and y' . This is a real surface C whose boundary consists of two circles S y and 
Sy>, called exceptional divisors, which can be identified with circles centered at in the 
tangent spaces T y C and T y >C. The following is a particular case of Corollary 10.2. 

Theorem 1.1. Any triple (P,A,<f>) defined over the smooth locus of C which satisfies 

8a4> — and H^a^IIl 2 < 00 

extends to the real blow up C . The restriction of the extension of to the exceptional 
divisors is covariantly constant, and takes values in the set of points which are fixed by 
the action of the limit holonomy of A around y. 

Let F C X be the set of fixed points. We say that a meromorphic connection has 
critical holonomy H around a pole if the set of points of X which are fixed by H is 
bigger than F. The set of critical holonomies forms a finite subset of S 1 . Theorem 1.1 
implies, in particular, that if the holonomy of A around y' is not critical then converges 
somewhere in the fixed point set as we approach z. When the holonomy of A is a root 
of unity (for example, if the holonomy is critical) Theorem 1.1 follows from Gromov's 
theorem on removal of singularities, considering local coverings ramified at y and y' so 
that the pullback connection has trivial holonomy. In fact, the holonomy of A specifies 
in this case a structure of orbifold near y and y', and the statement of the theorem is 
equivalent to saying that the section extends to the normalisation C as a section of 
and orbibundle. 

Applying Theorem 1.1 to the limit triple (P, A, 0) and recalling that the moment 
map fj, is equivariant we deduce that //(0) extends continuously to the normalization of 
C. Then the vortex equation implies that Fa also extends continuously, so the limit 
connection A is indeed meromorphic. 



A COMPACTIFICATION OF THE MODULI SPACE OF TWISTED HOLOMORPHIC MAPS 5 



1.5. In case the norms \cIa u 4>u\ are n °t bounded one can obtain a meaningful limit object 
by adding bubbles (rational components) to the curve C. When bubbles are attached 
away from the nodes the picture looks exactly like in Gromov-Witten theory (see [M]). 
While the first equation in (1.1) is conformally invariant, the second is not, and as we 
zoom in the curve (which is what we do to construct the limit object in the bubbles) the 
connection becomes more and more flat. So both P and A are trivial on bubbles away 
from nodes, and the only nontrivial object on the bubble is the section 0, which now can 
be seen as a holomorphic map from S 2 to the fibre of P x s i I over the point in C at 
which the bubble is attached. 

However, if bubbling off occurs near the nodes, new features appear. In the simplest 
case the curve C is replaced by the union of the normalisation C and a rational curve 
C , meeting in two nodes. The limit pair (P, A) need not be trivial on C : although 
the connection A will still be flat, it might have poles (nontrivial holonomy) around the 
nodes. In this case, the resulting bubble is not quite a rational curve. 

To give an example we use the cylindrical model Rx S 1 with coordinates t, 9 for the 
rational curve S 2 minus two points. Consider the real function H := — i/i : X — > R. 
For any real number 1^0, take a gradient flow line : R. — > X at speed / for the 
function H (so that ip satisfies the equation ip' = /Vif(-0))- Let : R x S* 1 — > X be the 
map <f)(t,6) := ip(t), and let a be the 1-form ildO. Then (1a '■= d + a gives a covariant 
derivative in the trivial bundle P over RxS 1 and we can look at also as a section of the 
associated bundle with fibre X. By convention, we assume that the complex structure 
in R x S 1 satisfies d/dt = id/ 89. With these definitions, A is flat and we have 8a4> = 0. 
Indeed, the latter equation takes the following form in terms of the cordinates t, 9: 

% = H*)% + KHM, (1.8) 

which is clearly satisfied. Note also that equation (1.3) is the equation satisfied by 
connecting orbits in Floer's complex for the Hamiltonian IH (see [F]). 

Taken as a connection over S 2 minus two points, A is meromorphic, with holonomy 
around the poles equal to exp(±27ri/). Let x± be the limit of ip(t) as t goes to ±oo. The 
contribuition of a bubble C to the energy of the limit object is given by the squared L 2 
norm of dA<fi on C . In the example which we have constructed this is 

\\d A m 2 = 2nl(H(x + )-H(x„)), (1.4) 

so it is finite. 



1.6. The factor H(x + ) — H(x_) in formula (1.4) is uniformly bounded by the constant 
sup x H — 'mix H . Hence, taking / very small, we can obtain bubbles with as small energy 
as we wish. This makes a big difference with Gromov-Witten theory, where the energy 
of a rational curve cannot be arbitrarily small. This plays an important role in the 
compactness theorem in Gromov-Witten theory, where it is used to bound the number 
of bubbles in a stable map in terms of the energy. 

To get a bound on the number of bubbles in our situation (which is crucial if we want 
to have a reasonable compactness theorem for twisted holomorphic maps of bounded 
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energy), we need to control the geometry of nontrivial bubbles with little energy. This 
is done in Theorem 8.4. In fact, the example constructed above gives a hint of what the 
general situation is: nontrivial bubbles with little energy can be identified with gradient 
lines of H, which go upward or downward depending on the holonomy of the connection 
(specifically, the holonomy has to be near but not quite critical, and the direction of the 
gradient line depends on the side of the nearest critical holonomy in which the holonomy 
of A is). A particular case of this fact was already proved by Floer in Theorem 5 of 
[F] (recall that the equation 8a4> = for bubbles is equivalent to equation (1.3) for 
connecting orbits in Floer's complex), namely, that in which the critical points of H are 
isolated and the indices of the limit points as t — > ±oo differ by one. The argument 
given by Floer relies on index computations and works only for generic almost complex 
structure. Our method of proof is different and is based on an analysis of how much a 
bubble with low energy deviates from being a gradient line (see Theorems 11.1 and 11.3; 
a similar statement is given in Theorem 1.2 below in this introduction). On the other 
hand, in our result we require the hamiltonian H to generate an action of S 1 , and from 
this point of view Floer's result is more general than ours. 

Using the results in Theorem 8.4 we can associate to any chain of nontrivial bubbles 
with little energy a chain of gradient lines, each one going from one critical point to 
another, with as many components as bubbles. Furthermore, since the bubbles are 
consecutive all the holonomies have to be the same, so the chain of gradient lines is 
monotone (either always upward or always downward). Finally, since X is compact, the 
number of components of a monotone chain has to be bounded. 



1.7. Let 0' be the extension of <fi to the blow up C given by Theorem 1.1. Another 
important question is whether, in the absence of blow up, the restrictions of 0' to each 
exceptional divisor S y and £y coincide (up to gauge transformation). The answer turns 
out to be no: the orbits to which 4>'{S y ) and 4>'{S y >) belong need not be the same. 
Instead, there is a monotone chain of gradient segments going from one to the other. 
Here by gradient segment we mean the image of a map £ : T — > X satisfying the equation 
£' = — V //"(£), where T C R is any closed interval (not necessarily infinite). A chain 
of gradient segments is a collection of gradient segments £i(Ti), . . . ,£ r (T r ) such that for 
any j the segments £j(Tj) and £j +1 (Tj +1 ) meet at a unique critical point fj G F, and the 
chain is said to be monotone roughly speaking if H(fj) decreases with j. 

This makes another difference with Gromov-Witten theory, where the analogous ques- 
tion is certainly true. The key point is that the diameter of the image of a holomorphic 
map / : C — > X, where C is any long cylinder, can be bounded in terms of the energy 
but independently of the length of C. This follows from the exponential decay of the 
energy density \df\ as we go away from the boundary of the cylinder. To understand why 
there is such exponential decay, we write the equation df = as an evolution equation 
ftij) = L{r)f{r), where L{r) is an elliptic operator on S* 1 which is skew symmetric up 
to compact operators. Then the nonzero spectrum of L(t) stays at distance > o > 
from 0, so the function \ft(r)\ 2 L 2i s i\ decays as e _crr (note that \df\ 2 = 2\f t \ 2 ). This is 
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something very general (see for example Chapter 3 in [D] for the case of instanton Floer 
homology) . 

If we replace the equation df = by 8a4> = 0, then the spectra of the operators L(r) 
are shifted by an amount depending on the holonomy of A. And when the holonomy 
aproaches a critical value, nonzero eigenvalues of L{r) can approach arbitrarily 0, so one 
does not get exponential decay for |<iA0| 2 - The standard way to study the geometry of <fi 
in such a situation is to break <p in two pieces ip and (fro, in such a way that ip is spanned 
by the small eigenvalues of the operator and 0o is spanned by the big ones. Then O 
does decay exponentially and so ip controls the geometry of <fi away from the boundary 
of the cylinder. Let us explain in concrete terms how this works in our context. 

Consider for simplicity the case in which the holonomy of A is nearly trivial but not 
quite. Suppose that C = [-N, N] x S* 1 and that P — > C has been trivialized in such 
a way that is approximatedly d + X9, where A G il\ {0} is very small. Suppose 
that <fi : C — > X has everywhere very little energy \cLa4>\ < e. We can then define 
ip : [-N, N] — > X by setting ip(t) to be the center of mass of 0(t, S* 1 ), and 0o : C — > TX 
by the condition cf>(t, 9) = exp^^ (f>o(t, 6). The following theorem, which combines parts 
of Theorem 11.1 and Theorem 11.3, gives the main properties of 0o and ip when is a 
solution of the equations. 

Theorem 1.2. There are constants K > and a > with the following property. 
Suppose that v = fdt A dO is a volume form on C such that f{t) < r)e~( N ~^\ Assume 
that the pair (A, 0) satisfies the equations 

8a4> — and i v Fa + fJ>(<f>) = c. 
Then we have |0 O (M)| < Ke~^ N '^ and 

\ip'(t) + i\VH(iP(t))\ < Ke-^ N -^\e + |A| + r]) 1 / 4 . (1.5) 

The condition on the volume form is natural, since we will consider cylinders like C as 
conformal models of curves of the form {xy = 5} C C 2 near the origin, with the metric 
induced by C 2 . The exponential bound on \<f> \ implies that away from the boundary of 
N the function can be approximated by ip, and equation (1.5) tells us that ip flows 
near a gradient line at speed — iA. 

1.8. To study the behavior of the solutions of (1.1) when C degenerates to a nodal 
curve, we take cylinders as conformal models, and both e and rj go to 0. Hence, if A — > 
as well, (1.5) implies that the map ip looks more and more like a gradient line. The 
following result, which is Theorem 12.1, makes this fact precise. The actual meaning 
of convergence of maps to a chain of gradient segments is given in Section 6.1. A good 
approximation of this notion is the convergence of subsets in the Hausdorff metric. 

Theorem 1.3. Suppose that {ip u : T u — > X,l u ,G u } is a sequence of triples, where ip u is 
a smooth map, T u C M is a finite interval, each l u is a nonzero real number and each 
G u > is a real number. Suppose that for some a > and any u and t e T u we have 

W u {t) - l u VH{^ u {t))\ < G u e-^ dT -\ (1.6) 
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Suppose also that G u — > and that l u — > 0. Passing to a subsequence, we can assume 
that l u \T u \ converges somewhere in MU{±oo} (here \T U \ denotes the length ofT u ). Then 
we have the following. 

(1) If\iml u \T u \ = then limdiam'?/> n (T u ) = 0. 

(2) If\\ml u \T u \ 7^ 0, define for big enough u and for every t e S u the rescaled objects 
S u := l u T u and f u {t) := ip u {t/l u ). There is a subsequence of {f u , S u } which 
converges to a chain of gradient segments T in X . 

This theorem is almost evident when the critical points of H (that is, the fixed points 
F) are isolated. In this case it suffices to pay attention to the complementary of a small 
neighbourhood of F (where, when u is big enough, l u VH dominates the error term and 
hence the vector field ip' u is almost equal to l u WH), and then shrink the neighbourhood 
to F. The main difficulty appears when the points of F are not isolated and we want 
to prove that the limit object is connected: this essentially amounts to proving that the 
portion of ip u (S u ) which lies near F has small diameter, which is not at all obvious (near 
F the vector field is small, so there is no hope to control pointwise the error term using 
VH — note that we also expect the preimage of a portion of f u (S u ) near F to become 
longer and longer as u goes to infinity, so the path /„ may wander slowly near F but for 
a long time). It is for proving an estimate on the diameter of the intersection of f u (S u ) 
with small neighbourhoods of F set (see Lemma 12.3) that we use an exponential bound 
in the error term. 

1.9. In view of all the preceding observations, it is clear that the objects which we 
should take as limits of solutions of (1.1) over smooth curves degenerating to nodal 
curves have to be of a mixed nature, combining two dimensional objects (holomorphic 
sections) with one dimensional objects (gradient flow lines). A very similar thing happens 
in other related moduli problems. In [CT] the second author and J. Chen construct 
a compactification of the moduli space of harmonic mappings from compact surfaces 
to compact Riemannian manifolds, and the limiting objects are a combination of 2- 
dimensional harmonic maps and geodesies (their one dimensional version). Another 
instance is the approach suggested by Piunikhin, Salamon and Schwarz [PSS] to proving 
the equivalence of Floer and quantum cohomology rings using spiked disks. 

The actual definition of c-STHM's (see Section 5.5) incorporates two additional fea- 
tures. The first one is that we consider marked points in the curves. Following the 
philosophy of Gromov-Witten theory, we treat marked points and nodes on an equal 
basis, so in particular we allow poles of the meromorphic connections on marked points. 
The other feature is some information, for each node of the curve, on how to define the 
bundle in a smoothening of node (this will be relevant for doing gluing, which will be 
addressed in a future paper). We call this information gluing data (see Section 3.4 for the 
precise definition). We also give a notion of equivalence between c-STHM's, we define 
a topology on the set of isomorphism classes of c-STHM's (see Section 6.5), and define 
the Yang-Mills-Higgs functional yMH c for c-STHM's (which is essentially like (1.2)). 
The main theorem of the paper is then the following (see Theorem 9.1). 
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Theorem 1.4. Let g and n be nonnegative integers satisfying 2g + n > 3. Let K > be 
any number, and let c e iK. Let {C u } be a sequence of c- stable twisted holomorphic maps 
of genus g and with n marked points, satisfying y 'A4Tt c (C u ) < K for each u. Then there 
is a subsequence converging to the isomorphism class of another c- stable twisted 

holomorphic map C . Furthermore, we have 

lim yMH c {C Uj ) = yMH c (C). 

1.10. Hamiltonian Gromov— Witten invariants. The main application of the results 
in this paper is the construction of Hamiltonian Gromov- Witten invariants coupled to 
gravity for compact symplectic manifolds, extending the construction given in [M] . This 
will appear in [MT], and will be based on the compactness result proved here and the 
technique of virtual moduli cycles. The invariants defined in [M] where obtained by 
integrating cohomology classes in the moduli space M of twisted holomorphic maps with 
a fixed curve C and representing a given class in i/j (X). In [MT] we will construct the 
moduli space J\Fg, n of c-STHM's of genus g and n marked points and we will define the 
Hamiltonian Gromov-Witten invariants coupled to gravity. The main new feature com- 
pared to [M] (beyond removing the semipositivity conditions imposed on the manifold 
and the action) is that the moduli space fibres over the Deligne-Mumford moduli space 
M. g ,n- This implies that the resulting invariants satisfy a gluing axiom similar to the one 
appearing in Gromov-Witten theory. 

1.11. Contents. We now briefly summarize the remaining sections of the paper. Sec- 
tions 2 to 4 are preparatory. In Section 2 we recall the basic objects in the geometry of 
Hamiltonian almost Kaehler fibrations (covariant derivatives, <i-bar operators and min- 
imal coupling form). In Section 3 we define meromorphic connections on bundles over 
punctured Riemann surfaces, and in Section 4 we recall the definition of orbibundle over 
orbisurfaces. These sections are included in order to fix notations in a way suitable for 
our purposes, and there is essentially nothing new in them. 

Sections 5 to 9 are devoted to defining c-STHM's and stating and proving the main 
theorem on compactness. In Section 5 we define c-STHM's and isomorphisms between 
them, and in Section 6 we define a topology on the set of isomorphism classes of c- 
STHM's. Section 7 is devoted to the Yang-Mills-Higgs functional, which plays the role 
of energy for c-STHM's. In Section 8 we prove that a bound on the energy of a c-STHM 
imposes a bound on the number of irreducible components of the underlying curve. This 
plays an important role in the compactness theorem, which is the main result of the 
paper and is stated and proved (modulo technical details) in Section 9. 

All technical details of the proof of the compactness theorem are given in Sections 10 
to 12. In Section 10 we prove an exponential decay for pairs (A, 0) satisfying 8a<P = 
and having finite energy on punctured disks. As a Corollary we prove Theorem 1.1 
in this introduction. We also prove that the energy of pairs on long cylinders decays 
exponentially provided the holonomy of the connection stays away from the critical 
residues. In Section 11 we study pairs on long cylinders with nearly critical holonomy, 
and we prove that in these pairs the section stays near a curve in X which satisfies up 
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to normalization and with some error the equation for gradient lines (this is essentially 
Theorem 1.2). Finally, in Section 12 we prove Theorem 1.3. 

1.12. Notation. We fix here some notation and conventions which will be used every- 
where in the paper. We denote by X a compact symplectic manifold with symplectic 
form u. The manifold X supports an effective Hamiltonian action of S l with moment 
map /i : X — > (LieS* 1 )*. We also take a S^-invariant almost complex structure / on X, 
compatible with the symplectic form oj in the sense that g := is a Riemannian 

metric. 

We identify Lie S* 1 ~ iR with its dual (iR)* by using the pairing (a, b) := —ah for any 
a, b G iR. Thus in the rest of the paper we will assume that fi takes values in iR. Let 
X be the vector field on X generated by the infinitesimal action of i G iR. Thus the 
infinitesimal action of any element A G iR gives rise to the vector field — \\X. We will 
denote H := — i/i. Using our convention, we have H = (//, i), so that dH = (dfi, i) = l x uj 
by the definition of the moment map. It follows that the gradient VH is equal to IX. 

Define, for any pair of points x, y G X the pseudodistance 

dist 5 i(x,y) := inf d(x,9 -y), 

where d denotes the distance in X defined by the Riemannian metric. Also, if M C X 
is any subset, define 

diam 5 i M := sup distsi(:r, y). 

x,yeM 

Finally, if A, B C X are subsets, we define 

dist^i (A, B) :— inf distsi(:r, y). 

xeA, yeB 

Whenever we talk about (marked) curves we will implicitly mean that they are compact 
connected complex curves with at most nodal singularities and eventually with some 
marked points. These will be a list of smooth labelled points xi, . . . ,x n , and we will 
often denote it by a boldface x. When we write x G x we will mean that x is one of the 
points of the list. 

Many of the constants appearing in the estimates will be denoted by the same symbol 
K, and often the value of K will change from line to line. 

2. Almost Kaehler geometry of Hamiltonian S 1 fibrations 

Let C be a (nonnecessarily compact) complex curve and let P be a principal S 1 
bundle over C. Denote by Y the twisted product Px s il and by 7r : Y — > C the natural 
projection. Let T vcrt be the vertical tangent bundle Ker<i7r C TY . Since the almost 
complex structure I is ^-invariant, it defines a complex structure of each fibre of T vert , 
which we still denote by /. We also denote by g the Euclidean form on T vcrt defined 
by the S^-invariant metric g on X . Let Iq be the complex structure on C (so Iq is an 
endomorphism of the real tangent bundle of C), and let gc be a conformal metric on C. 
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Any connection A on P induces a splitting TY ~ tt*TC © y vcrt , which we use to define 
1(A) (resp. g(A)) as the sum of n*Ic and / (resp., n*gc and g). 

Suppose that is a section of F. We define the covariant derivative g?a0 °f with 
respect to the connection A to be the composition of d<p with the projection n A : TY — > 
T vcrt induced by A. Hence, cLa4> is a one form on C with values in the pullback 0*T vert . 

Lemma 2.1. Lei S* fre i/ie interval [0, 1], /e£ 7 : S — > C fre a smooth map, and let <p be a 
section ofY. Then dist 5 i(07(l), 07(0)) zs ai most t/ie integral over S of \ (0^0(7), l') |- 

Proof. Take a trivialization 7*F ~ [0, 1] x X, and consider a gauge transformation g : 
[0, 1] — > S* 1 which sends 7*A to the trivial connection. Then we have dist^i (07(1), 07(0)) = 
dist s i(o07(l), o07(O)), and the latter can easily be estimated in terms of the integral of 
I (0^0(7), 77 1 (note that by covariance we have ((^0(7), 77 = (rf 9 » 7 . j4 (o0), <9/(9t)). □ 

We can split the space of forms f2 1 (C, 0*T vert ) as the sum of the space of holomorphic 
forms fi 1 >°(C7, 0*T vert ) plus the space of antiholomorphic forms n°^(C, 0*T vcrt ). Let d hA <t> 
be the projection of dA<j> to f2 0,1 (C, 0*T vcrt ). In concrete terms, 

d ItA <P-=\{dA<P + Iod A oIc). (2.7) 

(When the complex structure on X will be clear from the context, we will simply write 
8a4>-) Let us denote by $ : C — > F the map defined by the section 0. It is straightforward 
to check that 

9 JiA = O ^ 9 /(A) $ = 0. (2.8) 
On the other hand, for any section $ of Y we have 

= |dld c | 2 + |rf A 0| 2 = 1 + Ma0| 2 - (2.9) 

Given a connection A on P, there is a canonical way to pick a closed 2-form 00(A) on 
F which restricts to 00 on each fiber. This is the minimal coupling form (sec [GLS]). 
To give a local description of it we can assume that there is a trivialisation P ~ C x 5 1 . 
Let a G fi 1 (C, iR) be the 1-form corresponding to A with respect to this trivialisation. 
Let also iix '■ Y — > X the projection induced by the trivialisation. Then we have 

00(A) = n x oo — d(n*a A n x fi) = JTjW — 7r*o; A ir x ix0O — n*F A A vr^/x, 

(2.10) 

where Fa is the curvature of A. If A is flat, then 00(A) coincides with the 2 form 
g(A)(-, —1(A)-) when restricted to vertical tangent vectors. 

Remark 2.2. The cohomology class represented by 00(A) is independent of A, and can 
in fact be identified to the pullback of the class [00 — fit] in equivariant cohomology ( this 
denotes the class represented by the element 00 — fit of the Cartan-Weil complex R[t] ® 
Q(X) S , see for example [GS]j. More precisely, if c : C — > BS l is the classifying map 
for P , then we can identify Y ~ c*X s i and [00(A)] is equal to c*[oo — fit]. 
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3. Meromorphic connections on marked nodal surfaces 

3.1. Meromorphic connections on marked smooth curves. Let (C, x) be a marked 
smooth complex curve, and let P be a principal S 1 bundle over C \ x. For any x G x 
we denote by T(P, x) the set of trivialisations up to homotopy of the restriction of P to 
a small loop 7 around x (since for two different loops 7 and 7' the sets of trivialisations 
of the corresponding restrictions can be canonically identified, T(P, x) is independent 
of the chosen loop). Orient 7 counterclockwise with respect to the natural orientation 
of C as a Riemannn surface, T(P,x) gets a natural structure of Z-torsor 1 , given by the 
action of gauge transformations defined over 7: if t : P| 7 — > 7 x S* 1 is a trivialisation and 
t := [t] denotes its class in T(P,x), then for any n G Z we define n ■ r to be the class 
of the trivialisation tog, where g : P| 7 — > P| 7 is the gauge transformation given by any 
map 7 — > S* 1 of index n. 

Any r G T(P, x) defines an extension of P to x: if P C C is a disk containing x and 
having 7 as boundary, then we glue P to the trivial bundle over D using any extension 
of a representative of r to D \ {x} as patching function. This extension is well defined 
up to homotopy. In general, if we pick a collection of local trivialisations 

TET(P,^) :=Y[T(P,x) 

then we get an extension of P to C, which we denote by P T . We define the degree of 
the bundle P over (C, x) to be the map 

degP : T(P,x) -> Z 

which sends any r G T(P, x) to deg P(r) := degP T . 

We say that a connection A on P is meromorphic with poles in x if its curvature Fa 
extends to the whole C as a continuous 2-form. 

Suppose that A is meromorphic. Chose any metric on C, and fix some x G x. Let 7 e 
denote the boundary of a geodesic disk centered on x and of radius e, oriented counter- 
clockwise. Denote by Hol(A, j € ) G S* 1 the holonomy of A around j € . We claim that the 
limit 

Ro\(A,x) :=limHol(;4,7 e ) (3.11) 

exists. Indeed, if e > e' > then the quotient Hol(A, 7J/ Hol(A, 7 e /) is equal, by Stokes' 
theorem, to the exponential of the integral of the curvature P4 over the region enclosed 
between % and 7 e /. But since Fa extends continuously to x, this integral tends to as 
e — > 0, so the quotient of the holonomies tends to 1. The same argument proves that the 
definition of Hol(A,x) is independent of the chosen metric. 

Lemma 3.1. Let (r, 9) be polar coordinates defined on a small disk D centered on x G x. 
Any t G T(P, x) has a representative which extends to give a trivialisation of P on 
D \ {x} with respect to which the covariant derivative associated to A can be written as 

d A = d + a + \d6, (3.12) 



Recall that if T is a group, a T-torsor is a set T with a free left action of T. 
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where a G T(T*D <g> iR) is a 1-form of type C 1 , and A G iR satisfes H.o\(A,x) = e 2nX . 
Furthermore, any two choices of trivialisations associated to the same r G T(P,x) are 
related by a gauge transformation which extends to D. 

Proof. Take a trivialisation of P on D \ {x} extending any representative of r. With 
respect to this trivialisation we can write = d + a , so that Fa = da . Let 



That this limit exists follows as in the definition of Hol(A, x) from Stokes' theorem, 
writing the difference of the integrals on the right hand side for two choices of e as the 
integral of da on the corresponding annulus, and then using the fact that da^ extends 
continuously to D. By Poincare's lemma, there exists a G r2 1 (D,iR) of type C 1 such 
that doto = da. Hence /3 := ao — a — Xd9 is closed. But it is also exact, since for any 
e > we have 



(in the first equation we use that (3 is closed, in the second one that ao extends to D 
and in the third one the definition of A). Consequently, we can write j3 = dg for some 
g : D\ {x} — > S 1 . Now, the gauge transformation G := exp g transforms d + a into 
d + a + XdO. If another gauge transformation G' with winding number transform d + a$ 
into d + a' + XdO and a 1 extends to D, then we can write G' = dg' (because the winding 
number of G 1 is 0) and dg — dg 1 = a — a'. The right hand side is a closed 1-form which 
extends to Xj, hence g — g' extends to D. □ 

Corollary 3.2. Let P be a bundle over C\x and let A be a meromorphic connection on 
P. Let x' C x be the marked points around which the holonomy of A is nontrivial. Then 
there is a bundle P' — > C\ x' with a smooth connection A' such that t*(P', A') ~ (P, A), 
where i:C\x^C\x' denotes the inclusion. 

Remark 3.3. In fact, it is even possible to trivialize P around x in such a way that dA 
is in radial gauge, that is, it takes the form d + a + XdO and a = a$d0 is of type C l 
(in particular, ae vanishes at 0). Indeed, given a covariant derivative d + a + XdO as in 
Lemma 3.1 on the trivial bundle over the punctured disk, we can define g : D — > S* 1 to 
be the map whose value at z G D gives the parallel transport with respect to d + a from 
the fibre over z to that over 0. Then d + a — g~ x dg is in radial gauge. Restricting to the 
punctured disk, the gauge transformation g sends d + a + XdO to a connection in radial 
gauge. 

We call A =: Res(A, x, r) := Res(A, x, t x ) the residue of A on x with respect to r. It 
is straightforward to check that for any n G Z we have 



Remark 3.4. One can check easily that if A is a connection whose curvature Fa is 
uniformly bounded (but not necessarily extends continuously to C) then Lemma 3.1 is still 
true, with the difference that the 1-form a is in general only continuous. Consequently, 





Res(A, x,n ■ r) = in + Res(A, x, r). 



(3.13) 
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one can also define in this situation the residue Res(A,x, r) and formula (3.13) still 
holds. 

3.2. Meromorphic connections on marked nodal curves. Now suppose that (C, x) 
is a marked curve with nodal singularities. Denote by z the set of nodes of C . Let 
7r : C — > C be the normalisation map. We denote the preimage by it of the marked 
points x with the same symbol x. 

If P is a bundle over C \ (x U z), by a meromorphic connection on P we mean a 
meromorphic connection on ir*P such that for every node z G z, denoting by y, 2/ the 
preimages of z in the normalization, we have 

Hoi(Ay) = Hoi(Ay') _1 - 

Define 2 for any such node T{P,z) := T{7T*P,y) x z T{n*P,y'). A marked S 1 principal 
bundle over (C, x) is a pair (P, a), where P is a principal S* 1 bundle over C\ (xUz) and 

aeT(P,z):=l[T(P,z). 

Let C s be a smoothening of C (hence, C s is isomorphic to C away from a collection of 
neighbourhoods of the nodes of C, and near the nodes C s takes the form {xy = e} C C 2 
instead of {xy = 0} C C 2 ). The elements in a give a unique way (up to homotopy) of 
extending P to a bundle P s over C s \x (here we identify x with points in C s ). Then we 
define the degree of (P, a) to be the map 

degP CT := T(P,x) -> Z 

given by degP CT := degP s . 

A meromorphic connection on (P, a) is a meromorphic connection A on 7r*P with 
poles in xUy which satisfies the following compatibility condition on each node z: if y, y' 
are the preimages of z in C and we take a representative (£, £') G T{ji*P, y) x T(tt*P, y') 
of o" 2 then 

Res(A y, + Res(A, y', ^) = 0. (3.14) 

In other words, the sum of the residues in both preimages of the node has to vanish. 
That this condition is well defined follows from (3.13). 

It is straightforward to check that given a connection A on P there is a unique choice 
of a with respect to which A is meromorphic (that is, formula (3.14) holds). Using such 
cr, we define 

degP A := degP CT . 



2 If T and T 1 are two T-torsors and T is abelian, we denote Tx-pT' := Tx T' / <~, where the equivalence 
relation ~ identifies, for any (t, t') e T x V and g e T, (g ■ t, t') ~ (t,g ■ t'). 
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3.3. Chern-Weil formula. The following lemma gives a generalisation of the simplest 
Chern-Weil formula to the case of meromorphic connections on nodal curves. 

Lemma 3.5. Denote by Fa the curvature of a meromorphic connection A on P. For 
any r G T(P, x) we have 

degP A (r) = ^- / F A + iJ2^(Ax,r). (3.15) 



Proof. It is enough to consider the case of smooth C (in the nodal case, pulling back to 
the normalisation we are led to the smooth case). Hence we assume that C is smooth 
and that A is a meromorphic connection on a bundle P over C\ x. Let x = (xi, . . . , x n ). 
Pick some r G T(P, x) restriction of P to a small loop around each marked point and 
define Xj := Res(A, Xj, r). Take disjoint disks D\, . . . , D n centered at the marked points, 
small enough so that we can apply Lemma 3.1 to get trivialisations of P near each Xj 
for which the covariant derivative of A takes the form d + otj + XjdO (here (r, 6) are polar 
coordinates in Dj). For any j chose a smooth 1-form r]j on Dj which coincides with 
dO away from a small disk {r < e} C Dj. Now, modify A near each Xj by replacing 
d + otj + \jd9 by d + ctj + Xj-rjj. In this way we get a smooth connection A' on the bundle 
P T over C. Then using standard Chern-Weil and Stokes we compute 

deg P(r) = deg F*= *- £ F A , = ±-jj A + ±-^X, £ * b 

□ 

3.4. Gluing data. Suppose that (C, x) is a smooth curve, P is a principal S* 1 bundle 
over C \ x and A is a meromorphic connection on P. For every re G x, let S x be the 
quotient of T X C*/~R, where T X C* C T X C denotes the nonzero elements and 1R acts on 
T X C* by multiplication. The conformal structure on C induces an orientation and a 
metric on S x up to constant scalar. Imposing the volume of C to be 2n, we get a well 
defined metric on S x . A useful point of view is to look at S x as the exceptional divisor 
of the real blowup of C at x. 

The pair (P,A) induces a limiting pair (P X ,A X ), where P x is a bundle over S x and 
A x is a connection on P x in the following way. Pick any conformal metric on C and take 
a small e > 0. Let D t := exp x {v \ v G T X C*, \v\ < e}. Let D t /M denote the quotient 
by the equivalence relation which identifies y, z if and only if y = exp x u, z = exp x v 
and u G Mv. We can lift this equivalence to the restriction of P on D t by using parallel 
transport (with respect to A) along lines of the form {exp^ Am | A G R}, and we denote 
the quotient by P x := P|d £ /R- We look at P x as a bundle over S x using the obvious 
identification S x ~ D e /R. To define the limiting connection we proceed as follows: for 
any 5 > smaller than e, let S$ be the exponential of the circle of radius e in T X C. 
The composition of inclusion and quotient: S$ -D e — > -D e /R ~ 5^ is an isomorphism, 
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and we denote by g$ its inverse. Using parallel transport g$ lifts to an isomorphism 
P x ~ g%P\s s - One checks, using the fact that A is meromorphic, that the pullback 
connections g* A converge to a limit connection, which we denote by A x . (For example, 
use a trivialisation around x which puts cIa in radial gauge and as cIa = d + a + Xd9, 
where a extends to x — see Remark 3.3.) The resulting pair (P x , A x ) is independent of 
the chosen metric on C. Of course, the holonomy of A x around S x coincides with the 
holonomy Hol(A, x) of A around Ob dbS defined by (3.11). 

In other words, the pullback of (P, A) to the real blowup of C at x extends to the 
exceptional divisor S x , and the restriction to S x of the extension is isomorphic to (P x , A x ). 

Now suppose that (C, x) is a nodal curve. Let z G C be a node and let y, y' be its 
preimages in the normalisation C of C. Define the set of gluing angles at z to be 

T z := (T y C'®Ty,C')*/R. 

Recall that the nonzero elements of T y C ® T y iC specify deformations of C which 
smoothen the singularity at z (see Section 6.3). On the other hand, the set T z can 
be identified with the set of isometries 7 : S y — > S y > which reverse the orientations. 

Let P be a principal S* 1 bundle over C \ (x U z) , where z denotes the set of nodes. 
We define the set of gluing data for P at z to be the set F(P,z) of pairs (7, p), 
where 7 G T z is a gluing angle at z and p : 7*P,,' — > P y is an isomorphism of bundles 
satisfying p*A y > = A y . Clearly, since 7 reverses the orientation the condition for p to 
exist is that the holonomy of A y > is the inverse of that of A y *. Note that the projection 
ix z : F(P,z) — > T z has a natural structure of principal S* 1 bundle. Finally, we define the 
set of gluing data for P to be the product 

r(p) -.= i[m^ 

of sets of gluing data at each of the nodes of C. The group of gauge transformations of 
P acts in an obvious way on T(P) preserving the gluing angles (in other words, for every 
node z the induced action on T(P, z) preserves the map ir z ). 

Any diagram of the form 

P—^P' 

where g is an isomorphism of principal S* 1 bundles and / is a biholomorphism satisfying 
/(x) = x' induces a morphism of gluing data g* : T(P') — > T(P) (note that the 
vertical arrows in the diagrams are not surjections, since the bundles are defined over 
the set of smooth points of C and C which are not marked). 

A choice of gluing data for P should be understood as a device to specify deformations 
of P lifting certain smoothings of C. More precisely, suppose that G G T(P) consists of 
gluing data (71, pi), . . . , (7^, pu)- The set of gluing angles (pi, . . . , pu) give a real subspace 
T in the tangent space of M. g , n at the point represented by (C, x) (suppose for simplicity 
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that we only smoothen the singularities and do not modify the complex structures of 
the irreducible components of C). Then, for each deformation C< parametrized by some 
small segment t G [0, e) tangent to T, G gives a way to extend (P,A) along C t , in a 
infinitesimally unique way. 

This may seem a little unsatisfactory, since it would be preferable to specify some 
data which tells how to deform (P,A) along any deformation of (C, x) in A4 gtn . Such 
a thing is certainly possible if A has trivial holonomy around the preimages of z. In 
this case, the bundle n*P (recall that n is the normalisation map) extends over y and 
y' (essentially by Lemma 3.1) to give a bundle P', and picking an identification of the 
fibres P' y and P' , (which should not be confused with P y and P y >) gives a way to extend 
P to any smoothing of C at z. Furthermore, the set of possible identifications between 
P' y and P' y , is a torsor over S 1 . 

In view of this we could wonder whether it is possible to chose, when A has nontrivial 
holonomy H around y and y', a family parametrized by S* 1 of sections of the bundle 
r(P, z) — > T z , varying continuously with H (the point is that such a section would give 
a way to deform P for each direction of smoothings of C). But this is unfortunately not 
possible, as we now explain. Denote by Ti, Ti' the moduli spaces of connections on the 
trivial S 1 bundle over S y , S y >. We then have H ~ S 1 ~ H' canonically, the isomorphism 
being given by the holonomy. There are universal Poincare bundles V — > S y x H and 
V — > S y i x H! (with a universal connection A whose restriction to the fibre S y x {H} 
has holonomy H G S 1 , and similarly there is a connection A' on P'). Consider the 
bundle I := Isom('P,'P / ) over T z x S* 1 whose fibre on (7,-?/) is the set of isomorphisms 
between V\s yX {H} and rfV'ls^xiH- 1 }, where 77(01, if) := (7(a), if -1 ), which preserve 
the connections. What we are looking for is a family of sections C r(I|r z x{,f/}) for 
every if and depending continuously on H, at least homeomorphic to S 1 . If such a 
thing existed, it would form a fibration E over S* 1 with connected fibres, and hence it 
would admit sections. Any section of E would induce a section of I, which is impossible, 
because the bundle I is nontrivial (to see that I is nontrivial observe that the connections 
A and A' induce a connection A on I whose holonomy around T z x {H} is H; this forces 
the curvature of A to have integral over Y z x S* 1 equal to —2m). Of course, this is 
a manifestation of the impossibility of extending the universal Jacobian to the whole 
Deligne-Mumford moduli space as a fibration of smooth orbifolds. See [Fr] for a more 
general discussion. 



4. Orbibundles over orbisurfaces 

We will call an orbisurface a smooth complex curve C together with a list of points 
x = Xi, . . . ,x n G C and corresponding positive integers q = q±, . . . , q n . Take an or- 
bisurface (C, x, q). We now define (in a rather ad hoc way) what is an orbibundle over 
(C, x, q). (Of course, orbibundles are well known objects and can be defined in much 
greater generality than we do; we just recall the definition in this particular case to fix 
notations.) Pick a neighborhood Uj of each point Xj and a holomorphic covering map 
Pj '■ ^ j — > Uj of degree qj, such that Uj is biholomorphic to the unit disk and pj has 
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maximal ramification at the unique point Xj := Pj 1 (xj) and is unramified everywhere 
else. Let Tj := Z/g^Z. The group Tj acts on Uj leaving the map pj invariant. We will 
assume that the sets U\,...,U n are disjoint. A principal S* 1 orbibundle over (C, x, q) 
is a tuple 

P = (P,P 1 ,...,P n ,Vi,...,^), 

where P — > C \ x is a principal S* 1 bundle and, for each j, Pj — > Uj is a Tj-equivariant 
principal S* 1 bundle and ipj : P*jP\uj\{xj} — ► Pj\uj\{xj) ls an isomorphism. We leave to 
the reader the definition of isomorphism of principal orbibundles. 

To define the degree of the orbibundle P, pick a collection of local trivialisations 
t = (ti, . . . , T n ) G T(P, x). Each of them gives rise (pulling back through pj and using 
ipj) to a trivialisation Tj of Pj restricted to dUj. Such trivalisation allows to define a 
quotient of Pj over Uj/dUj ~ S* 2 , and we define deg(Pj,r,) to be the degree of this 
quotient. Then 

deg P := deg P T - ^ - deg(P j , Tj). 
One checks, using (3.13), that this expression does not depend on the choice of r. 

4.1. Relation to meromorphic connections. Suppose that (C, x) is a marked smooth 
curve and that P — > C \ x is a principal S* 1 bundle. Let A be a meromorphic connection 
on P, all of whose residues are of the form il for I G Q. Then we obtain in a natural way a 
structure of orbisurface on C and an orbibundle P A on it which restricts to P on C\x, as 
follows. Pick a collection of trivialisations r G T(P, x) and let the residue Res(A, Xj, r-,) 
be ipj/qj, where and > 1 are relatively prime integers. The resulting orbisurface is 
then C A := (C, x, q), where q = (gi, . . . ,q n ), and we define P, — > C/j to be the trivial 
bundle with the trivial action of Tj. So it remains to define the isomorphisms ipj (which 
should be a trivialisation of p*jP, since Pj is the trivial bundle). The residue of the 
connection p*A on the bundle p*P at the point Xj with respect to p*Tj is pj G Z. Then 
we take ipj to be the trivialisation of p*P with respect to which the covariant derivative 
dp* a is equal to d + a, where a extends continuously to Q 1 (Uj, iM) (such trivialisation 
exists by Lemma 3.1). 

It is now a consequence of the Chern-Weil formula in Lemma 3.5 that 

degP A = ^[ F A . 

4.2. Associated bundles. Of course, one can consider in general orbibundles which 
are not necessarily principal S* 1 orbibundles, but which are, more generally, locally 
trivial orbibundles. Their definition is completely analogous to that of S 1 principal 
orbibundle, substituting in each case the words "principal S* 1 bundle" by "locally trivial 
bundle". The notion of section of an orbibundle is also almost evident; so, for example, 
if Y := (Y, {Yj}, {ipj}) denotes a locally trivial orbibundle over an orbisurface (C, x, q), 
then a section of Y is a collection of sections $ = (0, 0i, . . . , <p n ), where <p G T(C \ x, Y) 
and {<pj G F(Uj, Yj)}, satisfying the compatibility condition ipj o p*<p = (pj. 
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This is the main example which we will encounter. Given a principal S* 1 orbibundle 
P = (P, {Pj}, {ipj}) over (C, x, q), we can define a locally trivial orbibundle Y := Px s iX 
on (C,x, q) by setting Y := (Y, {Yj}, where Y = P x 5 i X, Yj = Pj x s i X and 

ipj denotes the isomorphism induced by ipj. 

Recall that the de Rham complex of differential forms can be defined for orbifolds, 
and that its cohomology is isomorphic to the singular cohomology of the orbifold with 
real coefficients. This means in particular that we can use Chern-Weil theory to obtain 
representatives of Chern classes of orbibundles (allowing, in particular, to compute their 
degree). Another consequence is that one can define also the cohomology class [u;(A)] 
exactly as in Section 2. 

5. Stable twisted holomorphic maps 

5.1. Critical residues. Let Fclbe the fixed point set of the action of S 1 . For each 
connected component F' C F, there is an action of S* 1 on the corresponding normal 
bundle A" — > F' . Then A" splits as a direct sum A" = © xG ^A^ x , where A" x C A" is 

the subbundle on which S 1 acts with weight x- Define the set of weights of F' to be 
weight (F') := G Z | A" x ^ 0}. Define also the set of weights of X to be 

weight (X) := (J weight (F') C Z, 

F'cF 

where the union runs over the set of connected components of F. Finally, define the set 
of critical residues to be 

A cr := {A G iffi | there is some w G weight(X) such that w\ G iZ}. 

For any A G it we will denote 

X x := {x G X | e 2nX -x = x}. 

Of course, for any A we have F C X x , and the condition A G A cr is equivalent to the 
inclusion F C X x being proper. 

If A is a meromorphic connection on a principal S* 1 bundle over a nodal curve C and 
z G C is a node, we will say that the holonomy of A around z is critical if, denoting 
by y any preimage of z in the normalisation of C, we have H := Hol(A, y) G e 27rAcr 
(this is clearly independent of the chosen preimage of the normalisation, and hence well 
defined). In other words, the holonomy H is critical if the set of points in X fixed by H 
is bigger than F. 

5.2. Local behaviour of holomorphic sections near a marked point. Let (C, x) 

be a smooth marked curve and let P be a principal S* 1 bundle over C \ x, endowed with 
a meromorphic connection A. Let Y := P x s i X, let x G x be a marked point, and 
denote by D e C C the disk of radius e centered at x. Following the same ideas as in 
Section 3.4 we can define an equivalence relation on the restriction of Y on D e using 
parallel transport with respect to A along radial directions. Taking the quotient by this 
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equivalence relation gives rise to a bundle Y x over S x , which can be canonically identified 
with P x x s i X. 

Given a smooth section <fi of Y, we say that <fi extends at x to give a section <p x 
of Y x if, for every 6 G S y we have lim,5^o[0(exp. r 59)} = (f> x (0), where the brackets denote 
the equivalence class in Y x . In other words, this means that the pullback of the section 
(j) to the real blowup of C at x extends to the exceptional divisor. 

Theorem 5.1. Suppose that a section (pofY satisfies 8a4> — and ||rf40||L 2 (c\x) < oo. 
Then for every marked point x G x the section extends at x to give a section <p x of Y x 
and we have dA x 4> x — 0. Furthermore, the section <fi x takes values in X x . In particular, 
if the holonomy of A around x is not critical, then <f> x takes values in the fixed point set 
F, hence <p x is constant and the following limit exists 

<f>(x) :— lim (f>(z) G F. (5.16) 
z—*x 

The result of Theorem 5.1 is of local nature, so it follows from the corresponding 
version for the punctured disk, which is considered in Corollary 10.2 (see Section 10 for 
the statement and the proof of Corollary 10.2). 

5.3. Chains of gradient segments. Recall that we denote H := —ifj,. Let £ t : X — > X 
be the downward gradient flow at time t of H, so £ = ld x and 

^ = -£VH = -&IX, 

where X is the vector field generated by the infinitesimal action of i G LieS* 1 on X. 
A pointed gradient segment in X is a pair (x,T), where x is a point in X, not 
contained in the fixed point set, and T C R is a closed interval of positive measure. 
Define £t(x) '■= {£t( x ) I t G T} C X. Two pointed gradient segments (x,T) and 
(x',T') are said to be equivalent if £t(x) — ^t'(x'). A gradient segment in X is an 
equivalence class of pointed gradient segments. A chain of gradient segments is a 
finite sequence T of gradient segments represented by a list of pointed gradient segments 
((xi, Ti), . . . , (xk, Tk)) satisfying the following properties: 

(1) if j > 1 then inf Tj = — oo, and if j < k then supTj = oo; 

(2) if 1 < j < k then lim^^ £ t (xj) = lim^_ oc £i(x j+1 ). 

The beginnig of T is the point lim^ in f ^ ^(^i)) an d the end of T is lim t ^ sup T fc ^t(xfe). A 
degenerate chain of gradient segments is simply a point x G X (so this corresponds 
to the case (x,T) where T = {0}). 

Denote by T(X) the set of chains of gradient segments on X, including the 
degenerate ones. The group S 1 acts on T(X) as follows: if 6 G S 1 and T G T(X) is 
represented by ((xi, Ti), . . . , (x k , T k )) then 6 • T is represented by {{6 ■ xi, 7i), . . . , (6 • 
x k) T k )). The set T(X) carries a natural topology induced by the Hausdorff distance 
between subsets of X. With this topology, T(X) is clearly compact. 

If S denotes the circle and P — > S is a principal S 1 bundle provided with a connection A 
with trivial holonomy, then we say that a section T$ of the associated bundle Px s i T(X) 
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is covariantly constant (with respect to A) if, given a trivialisation with respect to 
which cIa = d, the map S — > T(X) given by the section Ts is constant. 

5.4. Metrics of fixed volume on stable curves. Recall that the Deligne-Mumford 
moduli space M. g , n of isomorphism classes of stable curves [C, x] admits a natural struc- 
ture of orbifold. The map / : -M 9i „+i — > M. g , n which forgets the last point and stabilises 
gives M.g^n+1 the structure of universal curve over Ai g>n - Let Aiet g>n be the space of all 
smooth (in the orbifold sense) metrics in AA g , n +i whose restriction to the fibre of / over 
[C, x] gives a metric on C of total volume 1 and in the conformal class defined by the 
complex structure. For any v G M.et g ^ n and [C, x] G M. g>n we will denote by v\c,-x\ the 
induced metric in C. If 7r : C — > C is the normalisation map, the pullback vr*z/[c, x ] is a 
smooth metric in C (this is true because v is smooth in «M 9)fl +i). Also, v\c,-x\ is invariant 
under the action of the automorphisms of (C, x). We give A4et g>n the obvious topology, 
which makes it a contractible space. 

5.5. Definition of c-stable twisted holomorhic maps. Let g and n be nonnegative 
integers satisfying 2g+n > 3. Take two natural numbers n cr and n ge such that n CT +n ge = 
n. 

Let (C, x) be a nodal curve of genus g and with n marked points. Repeatedly contract- 
ing the unstable components of C, we obtain a stable curve C st and a map s : C — > C st , 
called the stabilization map. Let C 6 C C be the union of the irreducible components 
which are contracted to a point by the stabilisation map, and let C p C C be the union of 
the components not contained in C b . Then we have C = C p UC b . The components of C b 
are called the bubble components of C, and those of C p the principal components 
of C. Finally, an exceptional point of C is a point which is either a marked point or 
a node. 

Pick a metric v G Aiet 9tn and an element c of iR. A c-stable twisted holomorphic 
map (c-STHM for short) of genus g and n marked points is a tuple 

C = ((C, x cr , x gc ), (P, A G), 0, {TJ, {T x }), 

where 

(1) C is a connected compact nodal complex curve, x cr and x ge are disjoint lists of 
smooth points of C: x cr is the list of critical marked points and x gc is the list 
of generic marked points; x cr contains n CT points and x gc contains n gc points, 
and we denote by x the union x cr U x gc . 

(2) P is a principal S* 1 bundle on the set of non exceptional points of C, 

P -> C\ (xUz), 

where z C C is the set of nodes. 

(3) A is a meromorphic connection on P, G is a choice of gluing data for P and <fi is 
a section of the bundle P x s i X. 

(4) For each preimage y of a node in C, 7^ is a covariantly constant section of the 
bundle P y x s i T(X A ), where A G iR is such that Hol(A,y) = e 2nX (this gives a 
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chain of gradient lines for each tangent direction at y in the normalisation of the 
curve, varying in a S^-equivariant way). 
(5) For each x G x ge , T x is a covariantly constant section of the bundle P x X51 T(X A ), 
where A G iR is such that Hol(A,x) = e 27rA . 

The tuple C must satisfy the following conditions: 

(1) The section is holomorphic. The section satisfies the equation 

d A <p = 0. (5.17) 

(2) Vortex equation. For any principal component Cj C C p , let Vj (resp. Aj, 0j) 
be the restriction of sV[c,x] (resp. A, 0) to Cj, where s is the stabilisation map, 
and let dvo\(i>j) be the induced volume form; then 

^o\{vj)F Ai + (J>(<f>j) = c; (5.18) 

we call this equation the vortex equation because in the case of X = C with the 
action of S 1 of weight 1 this equation coincides with the standard abelian vortex 
equation. 

(3) Flatness on bubbles. The restriction of A to each bubble component is flat. 

(4) Finite energy. The energy of as a section is bounded: 

||d40|| L 2 < 00; (5.19) 

(5) Matching condition at the nodes. Let z G C be a node, and let y, y' be its 

preimages in the normalization map. By Theorem 5.1, (5.17) and (5.19) imply 
that extends to give sections of <p y G T(P y x s i X) and (j) y > G T(P y i x s i X). Let 
p : P y — > P y i be the isomorphism given by the gluing data G. Then 

T y = p*T y/ 

and for every 6 G S y , 4> y {0) has to be either the beginning or the end of the chain 
T y (0), and p*(f> y >{9) has to be the opposite extreme. Furthermore, if the holonomy 
of A around y is not critical, then the chain of gradient segments T y has to be 
degenerate, which implies that p*(p y ' = <f> y . 

(6) Matching condition and the generic marked points. Given x G x gc , let 
4> x G r(P x X51 X) be the extension of 0. For any 6 G S x , <p x {6) is either the 
beginning or the end of %(9), and the opposite extreme of T x {6) is a fixed point. 
Furthermore, if the holonomy of A around x is not critical then T x has to be 
degenerate. 

(7) Stability condition for the bubbles. If C C C is a bubble component with 
less than 3 exceptional points, then the restriction of rf^0 to C is not identically 
zero. 

Two c-STHM's C and C are said to be isomorphic if there is a commuting diagram 

q 



C—^C, 
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where g is an isomorphism of principal S* 1 bundles and / is a biholomorphism satisfying 
/(x cr ) = x[. r , /(x ge ) = Xg e and preserving the ordering of the marked points, such that 

g*A' = A, g*G' = G, g'<j>' ' = <j>, g*T' = T y and gX = %- 

5.6. Remarks on the definition of Hamiltonian Gromov— Witten invariants. We 

now make a few comments with the hope of clarifying some of the ingredients appearing 
in the definition of c-STHM. We recall that the main application we have in mind of the 
compactness result proved in this paper is the definition of invariants of the manifold X 
and the Hamiltonian action of S 1 , using the moduli space of c-STHM's. 

First of all, in this paper we have imposed no restriction on c e iM. However, when 
considering the moduli space of c-STHM's we will need to take c away from a discrete 
set of critical values. The invariants obtained from two different choices of c may vary 
if we cross critical values when passing from one choice to the other (this is explained 
in [M], and is similar to the well known phenomenon of wall crossing in gauge theories). 
On the other hand, in order to define c-STHM's we have made a choice of an element 
v G M.et g ^ n ; the invariants which we will construct do not depend on this choice, thanks 
to a standard cobordism argument and the fact the set M.et g ^ n is connected. This is 
analogous to the fact that Gromov-Witten invariants are independent of the chosen 
compatible almost complex structure. In our situation, the obtained invariants will also 
be independent of the ^-invariant and compatible almost complex structure. Finally, 
the reason why we distinguish two different kinds of marked points (critical and generic) 
is the following: when constructing the moduli space, we will allow the residue at generic 
marked points to vary, whereas the residue at a critical marked point will always have 
to be critical. The way we define the evaluation map at a marked point will depend on 
whether the point is critical or generic. 



6. Topology on the set of c-STHM 

6.1. Convergence of lines to chains of gradient lines. Let F C X be the fixed 
point set. For any small 5 > 0, denote by F 5 the ^-neighbourhood of F, that is, the set 
of points of X at distance < 5 from F. Let also X s :— X \ F 5 . Finally, recall that we 
denote by H the function — i/x. 

Let {/„ : S u — > X} be a sequence of smooth maps, where each S u C K is a closed 
interval. We call the pairs (f u , S u ) lines in X. Let T be a chain of gradient flow lines 
(see Section 5.3) represented by a list ((xi, Xi), . . . , (xk, T^)) of pointed gradient segments. 
Let Fi,...,Fi be the connected components of the fixed point set which intersect the 
closure of {j£,T k {xk), labelled in such a way that H(F\) > H^F-z) > ■ • ■ > H(Fi). We will 
say that the sequence of lines {(f u , S u )} converges to T if for each small enough 
5 > and any big enough u we can write S u as a union of two sets 

S u = T s u U El (6.20) 

in such a way that: 
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(1) each E s u is a union of I closed intervals: E^ u — E^ u 1 U---U^„ and each T* is 
a union of k closed intervals: — U • • • U T* k , labelled in such a way that 
sup E^ j_ 1 < inf E^ j and supT„ -_ x < inf T„ ■ for every j for which the expression 
makes sense; 

(2) denoting by t 5 u - any point of -, the maps /„ : T^- — > X approximate gradient 
segments 

lim sup d{i t _ t s .(/u(O), /„(*)) = 0; (6-21) 

(3) the images f u (T^ j) approximate in X s the j-th gradient segment of T: 

lim lim DiUT^^x^nX 5 ) = 0, 

where here D denotes de Hausdorff distance between sets; 

(4) the images of the sets E^j become smaller and smaller as 5 — > and they accu- 
mulate near the fixed point set: 

lim flimsupdiam(/ u (^.))') = lim (lim sup d(f u (E s uJ ), Fj)) =0. 

This implies in particular that the sets f u (S u ) converge in the Hausdorff metric to 

116 (•'•,)• 

6.2. Convergence with gauge A of cylinders to chains of gradient lines. Now 

suppose that {0 M : C u — > X} is a sequence of smooth maps, where each C u is a cylinder 
C« = S u x S* 1 and S u C R is a closed interval. We call the pairs (0 M , C u ) cylinders in 
X . Let A G A cr be a critical residue. We say that the sequence of cylinders {<p u , C u } 
converges with gauge A to a chain of gradient lines T in X x if there is a sequence 
of lines in X, {(ip u , S u )}, which converges to T and such that 

lim sup d(e X9 (j) u (t,6),^ u (t)) = 0. 
u ^°° (t,e)ec u 

Here e xe denotes any number of the form e xe , where 6 G K is a lift of 9 G M/27tZ. Since 
the chain T is contained in X x , the resulting notion of convergence is independent of the 
chosen lifts. 

6.3. Description of the topology of M. g>n . We give here a description of the topology 
of the Deligne-Mumford moduli space which is suitable for our purposes. For that it 
suffices to specify what it means that a sequence of stable (pointed) curves converges to 
a given curve. 

Let (C, x) be a stable curve, denote by z the set of nodes, let 7r : C — > C be the 
normalization and let y be the preimages of z by n. Pick a conformal metric on C and 
let e > be a small number. For each y G y, take a neighborhood U y C C, small enough 
so that it admits a biholomorphism ( y : D(e) — > with the disk D(e) C T^C" centered 
at and of radius e. Chose also a small neighborhood B x C C" of the preimage of each 
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marked point x G x. Denote by / G End TC the complex structure of C, which we now 
view as a compact real surface. 

Let now I' G End TC be another complex structure which coincides with I on each 
U y and each B x (call such complex structure admissible). Take also, for each node z 
with preimages y,y' in C, an element 5 Z G T y C ® T y 'C satisfying \5\ < e 2 . We call 
the collection of numbers {5 Z } smoothing parameters. Define a new curve C(I', {S z }) 
as follows: replace the complex structure / by I', then remove for each for each pair 
y,y' of preimages of a node z, the sets Q(D(\S z \/e)) and Q'(D(\S z \/e)) from C, and 
finally identify for each pair of elements u G D(e) \ D(S z /e) and v G D(e) \ D(S z /e) 
satisfying u <E> v = 5 Z , the images ( y (u) and ( y >(v) (such identifications preserve the 
complex structure because I' is admissible). 

For later use, define for every z the subset N y (S z ) C C(I', {S z }) to be the image by ( y of 
the annulus D(e)\D(\S z \/e) (this is equal to the image by ( y i of the corresponding subset 
oiTy/C). The set N z is conformally equivalent to the cylinder: [In \5 Z \ — In e, lne] x S 1 . We 
will say that y is in the side of {lne} x S 1 , and that y' is in the side of {In \ Sj\ — In e} x S* 1 
(if we consider N y /(5 Z ) instead, then the roles are inverted). 

Note that for any compact set K C C\ (xUz) and small enough smoothing data {5 Z } 
there is a canonical inclusion K — > C(I', {S z }), which we will denote by lk- (For i K to 
exist it suffices to take each 8 Z so that ( y (D(8 z /e)) and ( y >(D(5 z /e)) are disjoint from K.) 

A sequence {(C U) x u )} of stable curves converges to (C, x) if for big enough u there is 
an admissible complex structure I u , smoothing parameters {S UtZ }, and an isomorphism 
of marked nodal curves 



such that /„ converges to / in C°° (End TC) and for each node z we have 5 UjZ — > 0. 

The topology on Ai g>n defined by this notion of convergence coincides with the usual 
one (see for example Section 9 in [FO], where the topology of M. 9} h is described in similar 
terms). 

6.4. Connections in balanced temporal gauge. Let C = [p, q] x S* 1 be a cylinder, 
and denote by (t, 9) the usual coordinates. Let (1a = d + a be a covariant derivative on 
the trivial principal S* 1 bundle over C. We will say that a is in balanced temporal 
gauge if it is in temporal gauge, so that a = add for some function a : C — > iR, and 
furthermore the restriction of a to the middle circle {(p + <?)/2} x S 1 is constantly equal 
to some A G iR, which is called the residue of A (with respect to the trivialization). Any 
connection on the trivial bundle over C is gauge equivalent to a connection in balanced 
temporal gauge. Furthermore, since da = %dt A d9, we have the estimate: 



Cm : (C(I U , {5 U: z}), x) — > (C u ,x u ), 



(6.22) 




(6.23) 



6.5. Convergence of c-STHM. Our aim here is to define a topology on the set of 
isomorphism classes of c-STHM's specifying as before what it means for a sequence of 
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c-STHM's to converge to a given c-STHM's. Before defining the convergence of se- 
quences, we make the observation that the notion of convergence for stable curves given 
in Section 6.3 makes perfect sense when considering nodal marked curves in general: 
if (C, x) is a nodal marked curve with k nodes, we can define as before the deforma- 
tions C(I u ,8i, . . . ,8k). (Of course, the topology induced by this notion on the set of 
isomorphism classes of nodal marked curves is not Hausdorff.) 

For simplicity, we will only define convergence of sequences of c-STHM's with smooth 
underlying marked curve and with degenerate chains of gradient segments at generic 
marked points. To pass from this to the general case is routine. 

So let {C u } be a sequence of c-SHTC's. Suppose that (C u ,x u ) is the smooth marked 
curve underlying C u , and that x M = x crjn U x geiU , (P u , A u ) is the bundle and connection 
on C u (since C u is smooth there is no gluing data) and M is the section of P u x 51 X 
(again, since C u is smooth there are no chains of gradient segments T y ); finally, for each 
x G x ge the chain T x is degenerate. Let now 

C = {{C, x cr , x^), (P, A, G), 0, {T y }, {%}) 

be another c-STHM, not necessarily with smooth underlying curve, and let z C C be 
the nodes of C. 

We will say that the sequence of isomorphism classes { [C u ] } converges to [C] if for any 
exhaustion K\ C • • • C K t C . . . of C \ (x U z) by compact subsets the following holds. 

(1) Convergence of the underlying curves. The curves (C u , x crjU , x gCjn ) converge 
to (C, x cr ,x ge ). This implies that there are isomorphisms 

*vtt • (C(Ao {^Ujz})? ^cn -^-ge) ¥ (C u , X crM , Xg C n ) 

such that I u —> I and 8 UtZ — > 0. Pulling back everything by £ M we can assume 
that the underlying curve of C u is C(I U , {8 U)Z }, x cr , x ge ). 

(2) Convergence of gluing angles. Let ((71, pi), . . . , {lk-,Pk)) be the glueing data 
at each of the nodes of C given by G. Since we assume that each C u is smooth, any 
gluing parameter 8 u j is nonzero and hence gives rise to a gluing angle [8 u j] G T z . . 
Then, for any j we must have [8 u j] 7j. 

(3) Convergence of the connections and sections away from the nodes. 
For each Ki and any big enough u (so that ik x is defined) there must exist an 
isomorphism of vector bundles 

Pu,l '■ P\Ki —> L *K t Pu 

such that p* U) ii* K A u converges to A and Pu^k^u converges to on Ki as u goes 
to 00 (here the convergence is assumed to be in C°°). This implies that the 
holonomies of A u around the marked points converge to those of A. 

(4) Convergence of gluing data. The isomorphisms p u j have to satisfy the fol- 
lowing additional condition. Suppose that r > is smaller than the e used in the 
definition of convergence of stable curves. Take some node z G z with preimages 
y, y'. Let Y(r) := Q(S(r)), where S(r) C T y C is the circle of radius r centered at 
0, and define Y'(r) similarly. Suppose that / is big enough so that both Y(r) and 
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Y'(r) are contained in K h and suppose that u is big enough so that i Kl exists. 
Define a map T u /(r) by the condition that the following diagram commutes: 

{ L *Ki P u)\Y{r) ( L K t P u)\Y'(r) 

Pu,l 

^\Y{r) *" "\Y'{r)i 

where ta u denotes the parallel transport along the images by ( y of the radial 
directions in the annulus D(e) \ D(5 z /e). Finally, let /' : P\y'(t) ~^ Py' an d 
/ : P\y{r) Py be the natural projections. Then the composition /' o T u j(r) o f 
gives an isomorphism between P y and P y > lifting the isometry between S y and S y i 
given by the gluing angle [5 Z ], so it specifies an element g u ,i,j(f) £ r(P, z). The 
condition is that for every j 



lim lim sup lim sup ^^(r), (7^)) = 0, 

\ u— +00 l^oo / 

where d denotes a fixed distance function defined on T(P, Zj). 
(5) Convergence near the nodes to chains of gradient segments. Fix some 
node z G C with preimages y, y' in the normalization. Suppose that the holonomy 
of A around y is e 27rA for some A G iR. We will denote 8 U := 5 U>Z . To specify the 
condition, we will use the cylinders 

N u :=N y (5 u ) ~ [\n\5 u \ -lne,lne] x S 1 

defined in Section 6.3. Define, for any big A > and u the cylinder 

iV M (A) = [In — In e + A, In e — A] x S 1 C N u . 

We distinguish two situations. 

(a) If the chain of gradient segments T y is degenerate (for example, if the residue 
A is not critical), then we must have 



lim I lim sup diam(0 u (iV n (A)) J = 0. 

(b) Otherwise, the following must happen; for each big enough u there is a 
cylinder M u C N u satisfying two properties. First, for any A > and big 
enough u, we must have M u C N U (A). Then the complementary is the union 
of two cylinders and M^ ,+ . The second property is the following. 

Suppose that P u is trivialized on N(5 U ) in such a way that dA u = d + a u is 
in balanced temporal gauge and with residue X u , and that A„ — > A. Then 
the section <p u restricts to give a map <f) u : N u — * X, and we must have: 

(i) the images of the sets have smaller and smaller diameter: 

lim ( limsupdiam s i(0 M (Mf i ' ± )) ] = 0; 

A^oo y J 

(ii) \ u 7^ A for big enough u, so the number l u := — i(A u — A) is nonzero; 
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(iii) the sequence of cylinders {(p u /l u : l u M u — > X} converges with gauge A 
to the chain of gradient segments T y . (here, if M u = [p, q] x S* 1 then 
l u M u = [l u p,l u q] and (f) u /l u (t,9) := <j) u (t/l u , 9)). 
(6) Convergence near generic marked points to chains of gradient seg- 
ments. This is similar to the previous condition. Let x G x ge . Let N be a 
punctured neighborhood of x biholomorphic to [0, oo) x S 1 . Define also N(A) : = 
[A, oo). We distinguish two cases. 

(a) If the chain % is degenerate then 

lim I limsupdiam(0 n (iV(A)) ] = 0. 

(b) Suppose now that T x is not degenerate. Let the holonomy of A around x be 
e 2wX . Then we must have A G A cr . There must be, for each big enough u, a 
cylinder M u = [A u , oo) satisfying: 

(i) 

lim I limsupdiam 5 i(0 M ([A, A u ] x S* 1 )) J = 0; 
a^oo y J 

(ii) take a trivialisation of P u with respect to which d Au = d + a u is in 
temporal gauge and the restriction of a u to {A M } x S* 1 is equal to 
\ u d0, where \ u G iK is a constant, and A M — > A. We must have A M ^ A 
for big enough u, so the number l u := — i(A„ — A) is nonzero; 

(iii) the sequence of cylinders {(p u /l u : l u M u — > X} converges with gauge A 
to the chain of gradient segments T x . 



7. The Yang-Mills-Higgs functional yMH c 

Let C = ((C, x cr , x gc ), (P, A, G), 0, {T y }, {%}) be a c-STHM. Let C p (resp. C h ) be the 
union of the principal components (resp. bubble components) of C. Let s : C — > C st 
be the stabilization map. Take on C p the conformal metric s*^[c st ,x] and extend it to a 
conformal metric on C by chosing an arbitrary conformal metric on the bubbles. Define 

yMHciC) := \\F A \\ 2 L2(CP) + \H<f>)-c\\ 2 LHCP) + \\d A( f)\\ 2 L2{c) . 

We call yA4H c the Yang— Mills— Higgs functional. Its value is independent of the con- 
formal metric chosen in the bubbles, since over the bubbles we only integrate the energy 
1 1 1112(c), which is conformaly invariant. In this section we will compute yM7i c (C) in 
terms of topological data and the residues of the meromorphic connection A. 

7.1. Orbifold structure induced by the critical residues. Let (C, x) be a smooth 
marked curve, letP^C\xbea principal S* 1 bundle and let A be a meromorphic 
conection on P. We can divide the set of marked points as x = x CTi a U x gCi A, where 
x C r,A (resp. Xge^) denotes the set of points around which the holonomy of A is critical 
(resp. noncritical, which we call generic). Since critical residues are of the form il with 
I G Q, we can use the construction in Section 4.1 to obtain, for any r G T(P, x CTi yi), an 
orbibundle P r,A = (P T , {Pj}, {i>j}) on C A := (C, x cr)j4 , q) (where q denotes the collection 
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of denominators in the set of critical residues of A). The following result gives a crucial 
property of holomorphic sections of P x s i X with bounded energy. 

Theorem 7.1. Suppose that <fi is a section of P x 51 X which satisfies d A 4> = and 
||^a0||l 2 (c\x) < 00. Then, for any r G T(P, X- ge ,A) the section (ft extends to give a section 
$ T,A of the orbibundle P T ' A . 

Proof. The fact that <fi extends to the points in x gCi A follows from Theorem 5.1 (since 
in this case the limiting orbit has to be a fixed point, there is indeed an extension no 
matter what trivialisation r we chose). The fact that extends to x cr ;A is a consequence 
of Gromov removal of singularities theorem. Indeed, pulling back (A, <p) to Uj \ {Wj} and 
applying a suitable gauge transformation g the connection extends to the whole Uj, hence 
defines a complex structure on Uj x X. The resulting map g • p*<p : Uj \ {Wj} — > Uj x X 
is complex and has bounded energy, so Gromov's theorem applies (see for more detais 
the proof the case of critical weight of Theorem 10.1 in Section 10.4). □ 

7.2. Computation of the Yang— Mills— Higgs functional. 

Lemma 7.2. Let (C, x) be a smooth marked curve, let P — > C \ x be an principal S l 
bundle, let A be a meromorphic connection on P and let be a section of P x s i X. 
Suppose that d A (fi = and that \\d A (p\\i,2 < 00. Then, for any r G T(P, x g c,A), we have 



Remark 7.3. Note that, since A might have nontrivial poles, the form oj{A) is singular 
on P T,A x However, one can define the cohomology class [00(A)] simply by taking any 
smooth connection A' and setting [00(A)] := [u(A')]. By Remark 2.2, this is independent 



Proof. The proof is similar to that of Chern-Weil formula for meromorphic connections 
in Lemma 3.5. For any small e > we modify A near each point follows: 
if in a neighbourhood of x we have = d + a + Xd6 (where a extends continuously to 
x and A = Kes(A,x,T x )), then we replace Xd6 by i] t , where i] e extends smoothly to x, it 
coincides with Xd6 away from the disk B(x, e) of radius e centered at x, and both the 
supremum of the norm \i] e \ and the integral of \di] e \ on B(x,e) is bounded above by a 
constant C independent of e (for example, one can define 77,, to be r 2 Xd6/e 2 in B(x, e—e 2 )). 
Let A t be the resulting smooth connection on P := P T < A over the orbisurface C = C A . 
Then we have for any e > 



This follows from formula (2.10) and the fact that (taking local trivialisations for (2.10) 
to make sense) for any point x G C and any choice of conformal metric on C we have 




of 



A'. 



J (^ A r[u(A)] = J (^ A THA,)] = l\\d A Mh - 1 f a A4>) 



(j)(x)*(7c* x u - 7i*a e A n* x L X u) = -(\d Ae (p\ 2 - \d Ae (p\ 2 ) dvol(x). 
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To finish the proof, we make two observations. First, that as e — > 0, \\d,A c 011^2 converges 
to 11^011^2- This is a consequence of the bound \r) € \ < C and of Theorem 10.1. Second, 
we similarly have, as e — > 0, 

^^ x gc, A 

Indeed, setting C t := C \ [J xeXgc A B(x, e) we have 

/ F A M) = [ FjUMtf) + E / F AA<t>)- 

The first integral on the right hand side clearly converges to f c To estimate the 

second term we compute for any x G x gEi A 

FaA4>)= [ F Ae (z)ti((f>(x))dz+ [ F Ae (z)(ti((f>(z)) - »((f>(x)))dz. 

B(x,e) Jz&B(x,e) J z£B(x,e) 

The last integral converges to as e — > because J B / xe ^ \dr) € \ < C and because /i(z) — > 
fj,(x) as z — > rr (this follows from Corollary 10.2). Finally, we compute 

/ FAX z )^{4>{ x ))dz = I dr] e fj,((p(x))dz + / dafj,(<fi(x))dz 

JzGB(x,e) Jz£B(x,e) J z£B(x,e) 



r} e /j,((f)(x))dz + / da/j,((j)(x))dz 

dB(x,e) JzeB(x,e) 

= 2irRes(A,x,T x )fi((f)(x))+ / da(x((j)(x))dz, 

JzeB(x,e) 

and the last integral converges to because da is integrable. □ 

Theorem 7.4. Let C = ((C, x cr , x ge ), (P, A, G), 0, {7^}, {%}) be a c-STHM. We have 
x g c,A C x gc and, for any r G T(P, x gCiA ) ; 

yMH c (C) =2 J (<S> t > a )*Ilu(A)] -47ricdegP r ' A + 47r ^ Res(A, x, r x ){^{(j>{x)) - c). 



xexgc 



Proof. The inclusion x g6iA C x ge follows from the definition of c-STHM 's. So we only 
have to prove the formula for yM7i c (C). For that we can compute separatedly the 
integral on each principal component Cj C C p (the computation for the bubbles is as 
in standard Gromov-Witten theory, except the case in which the connection is flat but 
has poles in two points of the bubble, which follows from Lemma 7.2). Using Lemma 
7.2 and the Chern-Weil formula in Lemma 3.5 we have (here all integrals are over Cj, 
y ge denotes the marked points and nodes of on which A has a pole with noncritical 
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residue, v denotes the volume form, and all norms are L 2 over Cj): 
= \\t v F A + fi((f>) -cf 

= ||F A || 2 +|| /U (0)-c|| 2 -2 J 'f a c + 2 J F A n((/>) 



= \\F A \\ 2 + \\n{4>) - c|| 2 + 4vric degP r,A - i ^ Res(A,x,r x 

\ ^ey gc 

+ ||^0|| 2 -47r Res(A,x,T x )n(<f>(x))-2 f [u(A)\. 

xey gc 

Rearranging the terms we obtain the desired formula. □ 

8. Bounding the number of bubbles in terms of yMH c 

Theorem 8.1. For any K > and any g,n satisfying 2g + n > 3 there exists some 
N with the following property. Let C = ((C, x), (P, A, G), 0, {T y }, {%}) be a c-STHM of 
genus g and n marked points. Suppose that yA4TC c (C) < K . Then the number of bubbles 
in C is less that N . 

The proof of Theorem 8.1 is given in Section 8.3. Note that the corresponding result 
in Gromov-Witten theory is an immediate consequence of the existence of a lower bound 
on the energy of nontrivial bubbles. In our situation such a lower bound does not exist, 
and this is why the theorem is not so obvious. The idea of the proof is the observation 
that bubbles with very little energy come in fact from gradient segments, and that a 
chain of consecutive bubbles with little energy gives rise to a chain of gradient segments, 
with at least as many components as the chain of bubbles. The number of components 
in a chain of gradient segments is at most equal to the number of connected components 
of the fixed point set of X. Hence, there is an upper bound for the length of a chain of 
consecutive bubbles with little energy. 

8.1. Connecting bubbles and tree bubbles. Take some c-STHM C, let (C, x) be the 

corresponding marked curve, let P be the principal bundle, and let A be the meromorphic 
connection. Let T := T(C, x) be the graph whose set of vertices is the set of irreducible 
components of C plus the set of marked points, and whose set of edges is the following: 

• for any pair of vertices v', v" corresponding to components C, C", there are as 
many edges connecting v' to v" as there are nodes in C at which C and C" meet 
(in particular, Y has a loop for each node whose two branches belong to the same 
component); 

• if v x is a vertex corresponding to a marked point x G x, then there is an edge 
connecting v x to the vertex corresponding to the component of C in which x is 
contained. 



We divide the set of vertices of V in bubble vertices, principal vertices and marked 
point vertices. A bubble vertex v of T is said to be exterior if there is a unique edge 



32 



I. MUNDET I RIERA AND G. TIAN 



having v as one of its extremes (such edge is not allowed to be a loop). A bubble vertex 
v of r is called a tree vertex if there exists a saturated subgraph TcT such that: v 
is a vertex of T, T is a tree, all vertices of T are bubble vertices, and there is a unique 
edge conneting a vertex of T to a vertex of T. For example, an exterior vertex is a tree 
vertex. A bubble vertex which is not a tree vertex is called a connecting vertex. We 
define the depth of a tree vertex v to be the minimal p for which there is a sequence of 
tree vertices v — v±, . . . , v p such that v p is exterior and each Vj is connected by an edge 

to Vj+i. 

A chain of connecting vertices is a connected saturated subgraph R C T all of 
whose vertices are connecting bubbles. The name is motivated by the fact that such a 
graph is necessarily homeomorphic to a segment, as the reader can easily check. 

We say that a bubble C C C is a tree bubble if the corresponding vertex in T is 
a tree vertex. Similarly, we define the connecting bubbles. Note that a tree bubble 
cannot contain any marked point. 

Lemma 8.2. If C C C is a tree bubble, then the restriction of A to C has trivial 
holonomy around any node contained in C . Hence (P,A) extends smoothly to C . 

Proof. For the first assertion, apply induction on the depth of tree vertices, using the 
fact that Fa restricts to on any bubble and the Chern-Weil formula in Lemma 3.5. 
The second statement follows from Corollary 3.2. □ 

Lemma 8.3. Let C±, . . . , C u be a sequence of connecting bubbles corresponding to a chain 
of connecting vertices R C V , labelled in such a way that each Cj shares a node with Cj + \. 
There exists some A G iR with the following property. Let the exceptional points of Cj be 
y + ,y^,z 1 , . . . ,z s , where y± is the node which Cj has in common with Cj±\, except that 
if j = 1 then y_ is either a marked point or a node which C\ shares with a principal 
component or another connecting bubble, or if j = s then y + may be either a marked 
point or a node (connecting C s to a principal component or a connecting bubble). Then: 
each z t is a node which Cj shares with a tree bubble, the holonomy of A around any 
is trivial, and the holonomy of A around y± is e ±2nX . 

Proof. Let R C T be a maximal chain of connecting vertices. Any vertex of T which is 
connected by an edge to an interior vertex of R is necessarily a tree bubble. Furthermore, 
each of the two vertices at the extremes of R are connected either to a marked point 
vertex or to a principal vertex, and all other vertices connected to them and not contained 
in R are tree vertices. This explains why the exceptional points of Cj can be labelled 
as y + , y_, zi, . . . , z s (s depends on j) and have the properties claimed in the statement 
of the lemma. By Lemma 8.2 the holonomy of A around z t is trivial. So A can have 
nontrivial holonomy only around y±. Finally, taking any A such that for C\ the holonomy 
around y_ is e~ 2nX , we deduce that the holonomy around the point y± G Cj is e ±27rA (use 
induction on j). □ 

8.2. Twisted bubbles. Let y C S 2 be a finite subset. We call a twisted bubble over 
(S 2 , y) a triple (P, A, <f>) consisting of a principal bundle P — > S' 2 \{y}, a flat meromorphic 
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connection A on P and a section of P x 51 X satisfying 8a4> = 0. We say that a twisted 
bubble (P, A, 0) is trivial if the covariant derivative cIa4> is identically zero. 

Let e > be a small number. Let A* r C ilR be the set of residues at distance < e from 
the set or critical residues A cr . Suppose that e is small enough so that for any A G A^ r 
there is a unique critical residue cr(A) G A cr lying in the same connected component as 
A. In this case, define 

A+ = {A G A^ r I -iA > -icr(A)}, A" = {A G A^ r | -iA < -icr(A)}. 

We then have a partition A^ r = A cr U A+ U A~. Define the set of generic residues to be 

A ge :=iM\A^ r . 

Theorem 8.4. Let y = {y+,y~} consist of two points. There exists some e > with 
the following property. Suppose that (P, A, 0) is a twisted bubble on (S 2 , y) satisfying 
H^a0||l 2 < e- Take some local trivialization of P around y + and let A be the corresponding 
residue of A at y + . Then, if e is small enough, we have 

(1) if A G A cr then (P,A,(j)) is trivial; 

(2) otherwise, by Theorem 5.1 the following limits exist 

4>{y+) '■= h m <f>(z) G F, 4>{y~) '■— lim (f>(z) G F; 

let F± be the connected component of F containing <f>(y±). Then we have 

(a) if A G A+ then H(F + ) < H(F_), with equality only if (P,A,(j)) is trivial, 

(b) if X G A~ then H(F + ) > H(F-), with equality only if (P,A,<f>) is trivial, 

(c) if X £ Ag C then (P, A, 0) is trivial. 

Proof. Suppose first that A G A cr , and write A = ip/q for relatively prime integers 
p,q. Take a covering n : S 2 — > S 2 of degree q ramified at y + and y_, in such a way 
that n*A has trivial holonomy around y + and y_. It follows from Corollary 3.2 that 
both P and A extend to give a bundle P' — > S 2 with a smooth connection A'. Then 
A' is flat, so trivializing any fibre of P' we obtain a trivialization of the whole bundle 
P' ~ S 2 x S 1 , with respect to which the connection A' is trivial. This induces in particular 
a trivialization of P, so we can view the section as a map : S 2 \ y — > X, and as 
such it is J-holomorphic and has bounded energy. It follows from Gromov's removal of 
singularities theorem that <fi extends to give a map $ : S 2 — > X. On the other hand, 
1 1 <i$ 1 1^2 < e, and if e is small enough this implies that $ is constant (this is a standard 
result in Gromov-Witten theory). Hence (Ia4> = 0, so (P, A, 0) is trivial. 

For the remaining cases, fix a conformal isomorphism between S 2 \ y and the cylinder 
1R x S 1 , and take in the latter the standard coordinates (t,9), in such a way that as t 
goes to ±00 we approach the point y±. 

Consider first the case A G A+. Suppose that e is very small, and let (P, A, 0) be 
a bubble satisfying: ||dA u 0u|U 2 < e i = d + Xd9 in some trivialisation of P, and 
— i(A — A cr ) G (0, e). Let a := Xd9. In the rest of the proof we are going to use the 
notation introduced in Section 10. Using (2) in Theorem 10.5, we can assume that both 
IMq0||l°° and 1 1 a — X cr d9\\ L oo are less than the e in Theorem 11.1. By Theorem 11.1 
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there is some tp : R — > X Acr and some (f) : R x S 1 ^ TX such that = exp^, O . Now, 

Theorem 11.3 together with da = implies that j^' + iA/A'(^)| < Ke <T ^ t]{ ~ N ' > for every 
iV. Making iV — > oo we deduce that ■0' = — iA/A"^), so -0 follows a downward gradient 
line of H. Consequently, either H(F + ) < H(F_) or H(F + ) = H(F_), and in the latter 
case -0 is constantly. Now, using (11.61) in Theorem 11.1 and taking N bigger and bigger 
as above we deduce that O = 0. It follows that, if H(F + ) = H{FJ), then is constant. 

The case A G A CT is proved in the same way. Finally, if A G A gc and e is small enough, 
then we can use (10.34) in Theorem 10.3 to deduce that the image of is contained in 
a small and ^-invariant ball B centered at <j>(y+) G F. Similarly, for any r G T(P, y) 
we have <& T ' A (S 2 ) C P x s i B. The equivariant cohomology of B comes entirely from the 
classifying space. Picking r such that deg P(t) = we deduce that ($ r,j4 )*[c<j(A)] = 0. 
This choice of r ensures that Res(A,y + ,r) + Res(A,y_,r) = 0. Also, 0(y_) belongs to 
the same connected component of F as (f>(y+), so f/,(<f>(y + )) = //(0(y_)). Hence, applying 
Lemma 7.2 to (P, A, 0) we deduce that ||gU0||l 2 = 0, so the bubble is trivial. □ 

8.3. Proof of Theorem 8.1. Let C = ((C, x), (P, A, G), 0, {T y }, {%}) be a c-STHM of 
genus g and n marked points satisfying yAi?i c (C) < K. A bubble C C C is said to 
be unstable if it contains less than 3 exceptional points. Let e be as in Theorem 8.4. 
(Note that a particular consequence is that if -0 : S 2 — > X is a Pholomorphic map with 
energy less than e, then -0 is trivial — this corresponds to the case of trivial holonomy 
around y± in the lemma.) 

Let r be the graph associated to (C, x). All subgraphs of T which we shall mention 
will be saturated. The subgraph of T consisting of connecting vertices can be written 
as the disjoint union of the set of maximal chains of connecting vertices Pi,...,Pq. 
Furthermore, / is at most equal to the number of marked points x plus the number of 
nodes of C (because the bubbles corresponding to vertices in Rj can be identified with 
the connecting bubbles which are contracted by the stabilisation map either to a given 
marked point or to a node). Hence, / < 3g — 3 + n. 

Each tree vertex of T belongs to a maximal tree T C T, all of whose vertices are 
tree vertices. Hence, the subgraph of T consisting of tree vertices is the union of trees 
Ti, . . . , T r . Each such tree Tj has a distinguished vertex, which we call the root, which 
is connected by an edge to a vertex of T \ Tj. If T is a tree, we say that a vertex of T 
is stable (resp. unstable) if its degree is > 3 (resp. < 2). Let \T\ denote the number of 
vertices of T. Then the number of unstable vertices of T is at least (\T\ + 2)/3. This 
follows from estimating the number of stable vertices by counting edges: T has \T\ — 1 
edges and each stable vertex contributes at least 3/2 to the total number of edges. Now, 
an unstable vertex of a tree Tj corresponds to an unstable bubble of C, unless the vertex 
is the root of Tj and it has degree 2 (hence its degree as a vertex of T is 3). In this case, 
the root can not be the unique unstable vertex of Tj, so there is at least one unstable 
bubble for each tree Tj. Since each such bubble is nontrivial, it contributes at least e to 
the total energy yMH c (C). Hence, the number of trees can be bounded as r < e~ l K. 
On the other hand, the previous arguments tell us that for each Tj there are at least 
(\Tj\ — l)/3 unstable bubbles (we are substracting here a unit in case the root does not 
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correspond to an unstable bubble). Hence we have Y7j=i(\Tj\ — l)/3 < e~ l K, which 
combined with the bound on r yields X^/=i l^j'l — e ~ l/ ^K. Hence, the number of tree 
bubbles in C is bounded. 

It only remains to prove that the length of any chain of connecting vertices of V is 
bounded by a constant independent of C. Let R C T be any such chain. A vertex of R 
will be said to be stable if its degree as a vertex ofT is > 3, and unstable otherwise. 
Each interior vertex of R which is stable is connected by an edge to at least one of the 
trees Tj. Since each tree Tj is connected by an edge to a unique vertex in T \ Tj, it 
follows that the number of interior stable vertices in R is at most e~ x K. So if we prove 
that each sequence of consecutive unstable vertices of R is bounded, it will follow that 
the number of vertices of R is bounded. Now, this follows from combining Lemma 8.3 
with Theorem 8.4, and the fact that the number of connected components of X is finite. 

9. Main theorem: compactness 

The following is the main result of the paper. 

Theorem 9.1. Let g and n be nonnegative integers satisfying 2g + n > 3. Let K > be 
any number, and let c e iK. Let {C u } be a sequence of c- stable twisted holomorphic maps 
of genus g and with n marked points, satisfying yAiH, c (C u ) < K for each u. Then there 
is a subsequence {[C U] ]} converging to the isomorphism class of another c- stable twisted 
holomorphic map C . Furthermore, we have 

lim yMH c (C Uj ) = yMH c (C). (9.24) 

The rest of this section is devoted to the proof of the theorem. Many ideas involved 
in the proof are the same that appear in the compactness theorem for stable maps in 
Gromov-Witten theory, which we will assume that the reader is familiar with (see for 
example [T, RT1, IS, FO]). First of all, note that if a subsequence satisfies C Uj — > C then, 
by Theorem 7.4, (9.24) holds automatically. 

9.1. Getting the first limiting curve. Let (C u ,x u ) be the nodal marked curve un- 
derlying C u , so that x M is the union of the critical points x cr M and the generic ones x gc u . 
By Theorem 8.1 the number of bubbles in each C u is uniformly bounded. It follows that 
we can assume (passing to a subsequence) that all curves C u have the same topological 
type. For any u, let x° C C u be a list of points such that, setting x^ := x u U x°, the 
marked curve (C u , x.' u ) is stable, and suppose that each x° has as few elements as possible. 
Taking a subsequence, we can assume that (C u ,x.' u ) converges to a stable curve (C",x'). 
The new set of marked points x' contains the limit x of the sequences x u C x^. We call 
the points in x original marked points. It is necessary to make this distinction be- 
cause the connection may have poles in the original marked points, whereas in the other 
marked points it will always be smooth. We remark also, to avoid confusion, that in the 
course of the proof the lists of marked points x^ and x' will increase, and that the curve 
C will change from time to time (the changes will be addition of rational components). 
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In the names of the following three sections (where we describe how bubbles are to be 
added to the limit curve C) we use the terminology of Section 8.1. 

9.2. Adding tree bubbles, first part. We first consider bubbling off away from nodes 
and marked points. Following the approach in [FO], we force the appearence of bubbles 
in the domains (C u ,x! u ) by adding new marked points near points where \dA u 4> u \ blows 
up. (In order to have a choice of Riemannian metric on each stable curve — which we 
need to define |<iA„0u| - we chose an element in A4et g ^ n >, where n' is the number of 
points in x', see Section 5.4. Later on we will use metrics on stable curves with more 
than n' marked points, and it will be implicitly assumed that we have chosen an element 
of the corresponding space M.et.) Suppose that K C C is a compact set which does not 
contain any original marked point nor any node. Since (C u ,x.' u ) converges to (C, x'), we 
can assume, provided u is big enough, that there is a canonical inclusion of lk : K — > C u , 
whose image we call K u (see Section 6.3). Assume that sup^- |gU„0u| is not bounded 
as u goes to infinity. Then we pick for each u a point x\ e K u where \dA u 4> u \ attains 
its supremum, which we denote by s u , and another point x\ at distance s^ 1 from x\. 
We add x\ and xf, to the list x[, and obtain a new marked nodal curve still denoted by 
(C u ,x.' u ). Passing to a subsequence, we can assume that {(C u ,x.' u )} converges to another 
limit curve (C",x'). Then we repeat the process: we take a compact set K C C not 
containing original marked points nor smooth points, and so on. The process stops when, 
for any such K, the sequence sup^ |<iA u 0j is bounded. That the process stops after 
adding a finite number of points is proved exactly as in Gromov-Witten theory (see for 
example Proposition 11.3 in [FO]): namely, each time we add two points, there appears 
a new bubble, and each bubble which appears contributes more than a certain amount 
e > to the energy yAi?i c (C u ), so there cannot be infinitely many. The key point here 
is that each A u is smooth on K u and has bounded curvature. Hence, when we zoom up 
the connections become more and more flat (this is because we are in real dimension 2), 
and in the limit they become trivial. So the bubbles which appear in this situation are 
actually holomorphic maps in the usual sense. 

9.3. Adding tree bubbles, second part. Suppose that the limit curve (C, x') has k 
nodes, so that the curves of the form (C u , x.' u ) can be identified with C'(I', 5 Uj i, . . . , S Ui k, x'). 
Now take some j such that 5 U := 5 u j is nonzero for big enough u. We say in this case that 
the node is new, i.e., it does not appear in the curves C u if u is big. Let := Nj(S u j) 
be the neck defined in Section 6.3. We fix an isomorphism 



and we denote as usual by t, 9 the standard cylindrical coordinates. Also, we take on N u 
the standard cylindrical metric. For any A > 0, let N U (A) := \p u + A, q u — A]. Suppose 
that 



In this case, we proceed as in §9.2, picking pairs of sequences of points x l u and x\ at 
larger and larger distance from the boundary of N u , such that s u := \d au (p u (xD\ goes 



N u ~ [ln\5 u \ - me, me] x S 1 =: [p u ,q u ] x S 1 




(9.25) 
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to infinity and x 2 u is at distance s" 1 from x\. We then add x\ and x\ to x! u , go to a 
subsequence so that there is a new limit (C u ,x.' u ) (C, x'), and repeat the process. We 
repeat this, going again to the beginning of §9.3, as many times as possible, and at some 
point we must stop. 

In case the j-th node was not new, that is, 5 U = for big enough u, we consider the 
normalization of each C u near the node, and apply the same technique as in the previous 
case in a neighborhood of each of the two preimages of the node. The only difference in 
this case is that instead of having a finite cylinder N u we will have a semiinfinite cylinder 
N u ~ [lne, oo) x S\ and we define iV u (A) := [lne + A, oo) x S 1 . 

Finally, we apply the same technique around each original original marked point, 
modelling again a punctured neighborhood of it with a semiinfinite cylinder [0, oo) x S 1 . 

9.4. Connecting bubbles appear. Take again some node in C and assume that it is 
new. Define the cylinders N u and N U (A) and in §9.3. Since we repeated the process in 
§9.3 as many times as possible, we must have now 

limsup limsup sup \d au (f) u \ J < oo. 

A^oo \ m^oo N U (A) J 

This means that for some A > we have limsup^^ sup^^ \d au <p u \ < oo. We now 
replace N u by N U (A) and denote the extremes of the new cylinder N u again by p u and 

q u - 

The neck N u C C u can either belong to a principal or a bubble component. Suppose 
we are in the first case. Then the vortex equation idvo\{v u )FA + fJ>(<f>) = c is satisfied [y u 
denotes the restriction of the metric on the componenet C u to which N u belongs). Let 
us write dvo\{v u ) = f u dt A d6. We have exponential bounds on each derivative of /: for 
any / > 0, 

\V l f u (z)\<K ie - d ^ dN -\ (9.26) 

where Ki is independent of u. This follows from the fact that v u is the restriction to C u 
of a smooth metric on the universal curve over a moduli space of stable curves, which is 
a compact orbifold; hence all its derivatives are bounded. On the other hand, the vortex 
equation takes the following form: 

da u = f u {c - n{4> u )). (9.27) 

Let e > be smaller than the e's in Theorems 10.3 and 11.1. For any u, let d u : 
[Pu, q u ] — > K be the function defined as 

d u {t) := sup \d au (p u \. 

{tjxS 1 

We define an e-bubbling list to be a list of sequences ({b^}, . . . , {b 7 u }) satisfying: 
(1) each bi belongs to N u , 
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(2) denoting 6° := p u and b r u +1 := q u we have, for each j between and r, 

hm V u +1 -K = oo. 

u^oo 

(3) for each j we have liminf^oo d u {V u ) > e. 

Lemma 9.2. The number of sequences in an e-bubbling list is bounded in terms of 
K :=su Vu yMH c (C u ). 

Proof. For any b G [p„ + l/2, g„-l/2], let C(6) be the cylinder [6- 1/2, 6+1/2] x S 1 . The 
lemma follows from this claim: there exists some rj > (independent of such that, 

whenever d u (b) > e, we have ||rf au u |U 2 (C(6)) > ?7- Indeed, then the length of a e-bubbling 
list is at most rf x K. To prove the claim, fix some K > 0. If ||d Q „0u|U 2 (C(&)) < ^o, then 
putting momentarily the connections in Coulomb gauge and using elliptic bootstrapping 
alternatively with equation (9.27) and d au (p u = (taking the bound (9.26) into account) 
we deduce a uniform bound on the L\ bound of da u . Since L\ C L\ for any p, we may 
apply Corollary 10.15 to deduce that ||rf ail 0u||L 2 (c(fe)) > 8 f° r some 8 > independent of 
{C u }. Finally, we set rj to be the minimum of 5 and K . □ 

Take an e-bubbling list ({6*}, . . . , {b r u }) of maximal length. If the list is empty, then 
we do nothing, and begin again the process in §9.4 with another node. If there is no 
node near which we can construct a nonempty e-bubbling list, then we pass to the next 
step in §9.6. If instead the list is nonempty, then we define the points x 1 ^ 1 := (b l u , 1) G N u 
and x^ 2 :— (b l u + 1, 1) G N u . We add to these 2r new points and do as always: pass 
again to a subsequence so that there is a new limiting curve (C u , xJJ — > (C, x') and 
begin again with a node of the new curve C. The process has to stop at some moment 
because the energy is bounded. 

As in §9.3, we do the same for the nodes of C which are not new: take the normal- 
ization and do what we did in the cylinders N u in a neighborhood of each preimage of 
the node, which is conformally equivalent to a semiinfinite cylinder. Finally, we consider 
neighbourhoods of each original marked point and do exactly the same. 

9.5. Vanishing old bubbles. After all this process of adding marked points at the 
curves C u we end up with a limiting curve (C, x'). We denote from now one C := C. 
Suppose that C has k nodes (this k will most of the times be bigger than the one in 
§9.3), so that for big enough u there is an isomorphism of marked curves 

: C(1 , 5 Uj i, . . . , 5 Uj k, x) — > (C u , x„). 

We say that a bubble Co C C is old if for each node Zj contained in Co the smoothing 
parameters S u j vanish. This means that the bubble C already existed in the curves C u , 
so for any u we have an inclusion C C C u . We say that an old bubble C vanishes in 
the limit if for big enough u we have dA u 4> u \c 7^ and 

lim \\d Au <p u \\ L 2 {Ca) = 0. 



A COMPACTIFICATION OF THE MODULI SPACE OF TWISTED HOLOMORPHIC MAPS 39 



9.6. Constructing the limiting c-STHM. Take a compact set K C C disjoint from 
the nodes of C and the original marked points. Since we have the bound 

supsup|d aAl O/>«l < 00 » 

u K 

standard arguments (for example, Lemma 10.13 combined with a patching argument 
as in §4.4.2 in [DK]) imply that there is a subsequence of the sequence £*(P U , A u , <f> u ) 
which, after regauging, converges to a limiting triple (Pr, Ak, 4>k)- Taking an exhaustion 
of the smooth locus of C \ x by compact sets, we obtain a limiting triple (P, A, <fi) which 
satisfies the vortex equations on principal components, and such that is holomorphic 
with respect to A. Furthermore, || 0^011 l 2 is finite. 

The triple (P, A, 0) will be part of the limiting c-STHM C. Let z be the set of nodes 
of C. By construction the bundle P is defined over C \ (x U z), and we now prove that 
A is meromorphic. On each principal component the pair (A, 0) satisfies the equations 

d A (j) = and tdvo\(u [cstx] )F A + n{4>) = c ( 9 -28) 

(here C st is the stabilization of C) . The second equation implies that the curvature of A 
is bounded, and the first one, combined with the fact that the energy HoU^H^ is finite, 
allows to apply Corollary 10.2 and deduce that /x(0) extends continuously to C. Going 
back now to the second equation again, we deduce that F A extends continuously to C, 
so A is meromorphic. On the other hand, the restriction of F A to each of the bubbles of 
C is zero. Finally, the stability condition in the bubbles is satisfied. 

Hence, it only remains to construct the limiting gluing data G for P and the collections 
of chains of gradient segments {%}, {T x }. This will be done in the next three sections. 

9.7. Chains of gradient segments and gluing data at the old nodes. Let y,y' 
be the preimages of an old node in the normalization of C. Then y and y' also belong 
to each of the normalizations of the curves C u , hence we have chains of segments of 
gradient lines T UjV and T u , y '- Since the space of gradients of segment lines is compact, we 
can assume that there are limit chains T y and T y r. The same happens with gluing data. 

On the other hand, each vanishing old bubble Cq C C gives rise to an infinite gradient 
segment T in X by Theorem 8.4. If C has only two exceptional points (it can't have 
only one, because then it would be a tree bubble), then we collapse Cq. This identifies 
two different nodes in C. Suppose that T a and % are the chains of gradient segments in 
each of them (taken with the same orientation). Then the chain of segments in the new 
node is the concatenation of T a , T and %. The gluing data is the obvious one. We leave 
the details of this construction to the reader. 

If Co has more than two exceptional points, y + , y_, z±, . . . , z r , and the poles of A u 
are in y±, then we get as before a limit gradient segment T. But instead of collapsing 
Cq we substitute it by a chain of r trivial bubbles, each of them containing one of the 
points zi, . . . ,z r and two nodes. And in the node shared by the bubbles containing Zj 
and Zj+i we take the portion of T which lies between the limits lim^oo S* 1 • <f>(zj) and 
lim^oo S 1 ■ <f>(zj + i). Again we leave the details to the reader. 
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9.8. Chains of gradient segments and gluing data at the new nodes. Take some 
new node z G z with preimages y,y' in the normalisation of C. For each u let, as 
always, N u := N y (5 U:Z ) C C(I', {5 U;2 },x') ~ C u be the neck stretching to z. Assume that 
N u = [p u , q u ] x S* 1 and define N U (A) := \p u + A, q u - A] x S 1 . There is some A > 
such that for any A > A we have 

lim sup sup \cLa u 4>u\ < e, (9.29) 

u^oo N U (A) 

where e > is less than the e's in Theorems 10.3 and 11.1. 

Assume that y is in the side of {p u } x S 1 and y' in the side of {q u } x S 1 . By Corollary 
10.2 there is a limiting triple (P y , A y , <p y ) defined over S y , and similarly for y' . Let 
O C X be the orbit on which <p y takes values, and define O' similarly. It follows from 
our construction that 

lim ( limsup d 5 i(0({p u + A} x S* 1 ), 0) ) = 0, (9.30) 

A-+oo y u^oq J 

and similarly for O' . 

Pick a trivialization of P u on N u with respect to which d>A u = d + a u is in balanced 
temporal gauge with residue A u (see Section 6.4). We can assume that |A U | < 1 and, 
passing to a subsequence, that there is a limit X u — * A. We distinguish two possibilities. 

(1) Suppose first that A is not critical, so that both O and O' lie in the fixed point 
set. In this case we can take the chains T y and T y i to be degenerate, and in fact 
we have O = O'. To see this it suffices to prove that 



lim lim sup diam(0 u (iV u (A))) = 0. 
a^oo y u ^oo J 

Now, this formula follows from applying Theorem 10.3. Indeed, on the one hand 
we have for any z = (t, 9) E N U (A ) a bound 

\d a (j)(z)\ < x e -^(*«A )) e _ 
Now, if A > A and z G N U (A) we have, for the same reason, 
\d a <f>(z)\ < Ke-^ A - A ^e- ad ^ dN ^e. 

In particular, ||d Q 0||L°°(Ar u (A)) goes to as A — > oo. Combined with (10.34), we 
deduce that 

lim diam 5 i(0 u (AT u (A))) = 

u — >oo 

which, using (9.30) and the fact that O is a fixed point, implies that the actual 
diameter goes to zero: 

lim diam(0 u (iV u (A))) = 

(2) Now suppose that A is critical. We have to construct for big enough u a cylinder 
M u := N U (A U ) x S 1 such that the conditions given in Section 6.5 (subsection 
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Convergence near the nodes to chains of gradient segments) are satisfied. Observe 
first of all that for any A > there is some u(A) such that if u > u(A) then 

Mt ■= ([Pu,Pu + A] U [q u - A, q u }) x S 1 

is contained in the smooth locus of the limit curve C . In particular, restricts 
to give a map : M A — > X. On the other hand, for any e > there is some 
u(A, e) > u(A) such that if u > u(A, e) then 

sup d s i((j)(z),(f) u (z)) < e 

(because U converge modulo gauge to 0). Now take sequences A r — > oo and 
e r — > 0, and define for every r the number u r := w(A r ,e r ). We can assume 
without loss of generality that the sequence {u r } is strictly increasing. Now, take 
any u > and define r by the condition u r < u < u r+ \. Then define A u := A r 
and also M u := N U (A U ). Given A and a big enough u, let be the two 

connected components of A^ U (A) \ M u . It follows from the construction that 

lim ( limsupdiam 5 i(0 M (M^' ± )) ) =0. 



So it remains to prove that the sequence of cylinders (<f) u , M u ) converges with 
gauge A to a certain chain T e T(X A ). 

If u is big enough and A > A , Theorem 11.1 gives us a map i> u {A) : T U (A) : = 
\p u + A,q u — A] — > X. Let /„ := — i(A u — A). Formula (11.63) in Theorem 11.3 
(applied first to T U (A ) and then to T U (A)) implies that 

|^ tt (A)'(t) - lJ(m)X(Mm))\ < Ke-^ A - A ^e-^ 9S ^K (9.31) 

for some constant K. Let M u = T u x S 1 . Since A u — > oo, the previous inequality 
allows us to apply Theorem 12.1. Hence, passing to a subsequence, we can 
distinguish two possibilities. 

• If l u \T u \ then diam(ip u (T u )) — > 0, and, passing again to a subsequence, 
we may define T y to be the degenerate chain of gradient segments with 
constant value the unique point x G X such that \imd(ip u (T u ), x) = 0. 

• If lim/ u |T u | 7^ then for big enough u we have l u ^ 0, and by the theorem 
(l~ lr ip u , l u T u ) converges to a chain of gradient segments, which we denote by 

%■ ; 

Inequality (11.60) and formula (11.58) in Theorem 11.1 imply that the renormal- 
ized sequence (/~ 1 M , /„M M ) converges with gauge A to T y . 

Finally, to define the gluing data one only needs to pass to a suitable subsequence, as 
is clear from the conditions Convergence of gluing angles and Convergence of gluing data 
in the definition of convergence of sequences of c-STHM (Section 6.5). Indeed, the set 
of possible gluing data in a c-STHM is compact. 
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9.9. Chains of gradient segments in generic marked points. Let {x u } be a se- 
quence of original marked points, where each x u G x geiU . Let X u G iR satisfy e 2wX = 
Ho\(A u ,x u ), and let T u be the chain of gradient segments T Xu . Suppose that {x u } con- 
verges to x G C This means that there is a compact subset K C C \ (x U z) such 
that for big enough w we have x u C lr{K) C C u . Passing to a subsequence we can also 
assume that X u — > A G iR. In this section we show that, passing to a subsequence, there 
is a well defined limiting chain of gradient segments T x . The analysis is very similar to 
the previous one, so we will be sketchy. We distinguish two situations. 

(1) Suppose that A is not critical. In this case we define T x to be the unique degenerate 
chain which satisfies the matching condition with <f> x . (Note that for big enough 
u the residue A u is also noncritical, hence T u is degenerate.) 

(2) Suppose that A is critical. In this case, passing to a subsequence, we can assume 
that one of the following possibilities holds: 

• either X u is critical for big enough u, in which this case we define T x to be 
the limit of the sequence of chains {T u }, 

• or X u is not critical for big enough u, and we follow the same strategy as in 
case (2) of the previous section to define T x . 

10. Local estimates and cylinders with noncritical residue 

Theorem 10.1. Let P be a principal S l bundle on the punctured disk D* , and fix an 
element r G T(P, 0). Assume that A is a connection on P whose curvature is uniformly 
bounded, \Fa\l°° < oo ; and such that Kes(A, 0, r) ^ iZ. Let be a section of the trivial 
bundle P X51 X which satisfies 8a4> = and ||<iA0||L 2 (B*) < 00. Then, denoting by (r,0) 
the polar coordinates on D* , there are constants K > and v > such that 

\d A <j)(r,e)\<Krr 

The proof of Theorem 10.1 will be given in Section 10.4. 

Corollary 10.2. Under the hypothesis of Theorem 10.1, the section extends at the 
origin to give a section 0o o/Yq. In particular, the composition of <fi with the moment 
map, [/,((/>), extends to a continuous map from the disk D to iR. Furthermore we have: 

(1) The section 0o is covariantly constant with respect to the limiting connection A 
on P , that is, dA <f>o — 0. 

(2) Let A := Res(A,0,r). The section O takes values in X x . In particular, if X is 
not critical, then <p takes values in the fixed point set F, hence 0o is constant 
and the following limit exists 

0(0) := lim 4>{z) G F. (10.32) 

z—*0 

Proof. We prove (1). Take a trivialisation of P around for which d A = d + a + Xd6, 
where a is of type C 1 on the whole disk D and is in radial gauge, i.e., a = add for some 
function a on D vanishing at the origin. The trivialisation of P induces a trivialisation 
of 

Pq So, with respect to which c^ — d + Xd6. Using the trivialisation of P we 
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look at the section as a map : D* — * X. Now define for every < r < 1 the map 
r : S 1 — > X by (f) r (0) '■= (f>(r, 9). It follows from the estimate Theorem 10.1 that the limit 
0o := lim r ^ o 0r exists and is of type C 1 . Define now the connection A r on the trivial 
S 1 bundle over the circle using the 1-form a r (r,9) := ra(r,9)d9. It follows also from 
Theorem 10.1 that o?a o 0o — hm r ^ <^A r 0r = 0. Finally, (2) follows from (1), observing 
that 0o takes values in a unique orbit of the action of S 1 on X. □ 

To study the local properties of equation (2.8) we restrict ourselves to considering 
trivial principal S* 1 bundles P over a (nonnecessarily compact C). Then Y = C x X, 
the sections of Y can be identified with maps C — > X, and the connections on Y are the 
same as forms a G Q 1 (C, iR). Finally, there is a canonical splitting TY = TC ®TX (we 
omit the pullbacks). With respect to a given form a the bundle of horizontal tangent 
vectors is T^ or = {(u,ia(u)X) | u G TC} C TY! Consequently, the covariant derivative 
of a section : C — > X is d a 4> — d(p — iaX((p). Finally, if I is an almost complex structure 
on X, we have 

diA = d I( f) - \{iaX{4>) + i(a o I c ){IX){<j>)). 

A pair (a, 0) consisting of a 1-form a on C with values on iR and a map : C — > X 
will be simply called a pair; to specify both the curve C and the target manifold X we 
will write 

(a,0) : X. 

If 7 is any ^-invariant almost complex structure on X, we will say that the pair (a, 0) 
is J-holomorphic if the equation <9/ jQ ,0 = is satisfied. 

For any natural number N, denote Cn '■= [-N, N] x S 1 . We can now state the second 
main result of this section. 

Theorem 10.3. For any noncritical A G iR \ A cr there exist some K > 0, o > and 
e > 0, depending continuously on \, with the following property. Let («,0) : CV — > X 
be a I -holomorphic pair satisfying \\a — XdO^oo < e and \\d a <j)\\L°°(c N ) < e - Then the 
following inequality holds for any t, 9: 

K0M)l < ^e- CT ^l)|K0|| L oo (Cjv) . (10.33) 

In particular this implies 

diam sl (0(CAr)) < K\\d a <P\\ Lx{CN ). (10.34) 

The proof of Theorem 10.3 will be given in Section 10.5 

10.1. Equivariant charts. Let Jo (resp. go) denote the standard complex structure on 
C n , viewed differentiable manifold. 

Lemma 10.4. There exists a finite set W = W(X) C Z n and a real number r = 
r(X,cu,I) > with the following property. For any e > and any x G X there exists 
an equivariant open neighbourhood U of x, an action of S 1 on C n whose collection of 
weights belongs to W(X), an equivariant almost complex structure I x and metric g x on 
C n satisfying \\I X — /o||l°° < e an d Wdx — QqWl 00 < e (both norms taken with respect to 
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g Q ), and an equivariant map £ : U — > C n which is a complex isometry. Furthermore, we 
can assume that £(£/) is contained in the ball B(0,2r) C C n . 

Proof. Take r to be the length of the longest orbit in X divided by 2n. If x is a fixed point 
the chart is easily constructed using the exponential map with respect to the invariant 
metric g. Suppose that x is not a fixed point, and that the length of the orbit through 
x is lup. Let T be the stabiliser of x, which acts linearly on T X X. Let L C T X X be the 
complex subspace generated by X(x) and IX(x) (recall that X G Y(TX) is the vector 
field generated by the infinitesimal action of LieS* 1 ). Both X(x) and IX(x) are fixed 
by the action of T, so L is T-invariant. Let N C T X X be the Hermitian ortogonal of 
L, which is a T-invariant and complex vector subspace. Take a complex isomorphism 
(£;«■- 1 ~ j\r Consider the induced action of V on C n_1 . Fix any linear extension p of this 
action to S 1 , which we may assume (chosing appropiatedly the identification C™^ 1 ~ N) 
to be diagonal and compatible with the standard metric in C™ -1 . Let k G N be the order 
of r, consider the action of S* 1 on C* x C™ -1 given by 6 ■ (x,y) := (6 k x } p(6)(y)). There 
exists an ^-invariant neighbourhood R C C* of the circle S p of radius p centered at the 
origin in C and an S^-equivariant embedding 

f :Rx C^ 1 -> X 

(here we view R x C n_1 as a subset of C* x C n_1 ) which satisfies, for any ( G 1 and 
n G C™^ 1 , f(e t ,u) := exp x (ktIX(x) + u). The differential of / at any point in S* 1 x {0} 
is complex. Hence, if R is small enough, there exists an equivariant almost complex 
structure I x on C n satisfying \\I X — /o||l°° < e and which induces a complex structure on 
R x C n_1 with respect to which the restriction of / to a neighbourhood V of R x C™" 1 is 
complex. Also, since the restriction of / to S p x {0} is an isometry, we can assume that 
the pullback metric f*g extends to C n satisfying \\g — go\\L°° < £■ Then we set U = f(V) 
an.U: ifv) . 

It is easy to see, using the compactness of X and the rigidity of representations of 
compact groups, that the set of weights of the representations which we construct as x 
moves along X forms a finite set. □ 

10.2. An inequality for the energy on cylinders. Let C := [—2, 3] x S* 1 with the 
standard product metric. Whenever we write any norm (L°°, L 2 , etc.) of either the 
connection or the section of a pair defined over C, unless we specify some other domain, 
we mean the norm over C. Define also the following subsets of C: 

Z := [-1,2] x S\ Zj := [-1,2] x S 1 , Z n := [0,1] x S 1 , Z in := [1,2] x S 1 . 

(10.35) 

Theorem 10.5. For any noncritical residue A G iR \ A cr there exist real numbers 
e = e(\,X,I) > 7 = i(\,X,I) G (0,1/2), and K = K(\,X,I) > 0, depending 
continuously on \, such that if (a, 0) : C — > X is an I -holomorphic pair satisfying the 
conditions \\a — \d9\\L°°(c) < e an d \\d a (fi\\L 2 (c) ^ e i then 
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(1) the following inequality holds: 




(2) if I, a and are of class C 1 , then sup z \d a <f>\ < K\\d a (fi\\ L 2( C y 

To prove Theorem 10.5 we need an analogous result for the case X = C™ which we 
now state. Fix a nontrivial diagonal action of S* 1 on C n , and denote its weights by 
w = (w 1 , . . . , w n ) G Z n . Consider on C n the standard Riemannian metric g , and denote 
by Io the standard complex structure. Finally, define, for every r\ G R, 

: = e^+e-V <10 ' 36) 

Lemma 10.6. There exist real numbers e = e(w) > and K = K(w, A) with the 
following property. Suppose that I is a smooth equivariant almost complex structure in 
C n such that \\I — Io\\l°° < e, and that (a, 0) : C — > C n is a I -holomorphic pair satisfying 
\\a — Ad^||x,oo(cf) < e and ||d a 0||i2( C ) < e. T/ien 

(1) the following inequality holds: 

\\d a m 2{Zn) < 7(2/ min ) (KtfHi^z,) + K^lli-tz^)) , (10-37) 

where l min := min{| — k + ivjj\\ \ k G Z, 1 < j < n}; 

(2) we can bound: 

sup|rf a 0| <K||rf a 0|| L2(c) . (10.38) 
z 

Lemma 10.6 will be proved in Section 10.8 below. We now prove Theorem 10.5. Take 
e smaller than the infimum of e(w) as w moves along W(X). Take, for this choice of 
e and for each x G X, an equivariant chart £ x : — > C n as in Lemma 10.4. Then 
take a finite subcovering of {C/ X }xex an d denote it by {Ui, . . . , C/ r }. Denote also by £j 
and Wj the corresponding embeddings in C n and weights. Since the sets Uj are S* 1 - 
invariant and open, it follows that there is a constant e' > such that for any K C X 
satisfying diamsi K < e' there is at least one j such that K C L^-. Now, if we take 5 > 
small enough, it follows from Lemma 10.14 that diamsi <fi{C) < e'. Then the statement 
of Theorem 10.5 follows from Lemma 10.6 applied to £ 3 - o <f> : C — > C n . Indeed, the 
condition that A is not critical implies that, applying Lemma 10.6 to any chart, we have 
that Z m i n > 0, so 7(2/ min ) < 1/2. Then 7 can be defined as the maximum of the numbers 
7(2/ min ) computed for each of the charts Ui, . . . , U r . 

10.3. From mean inequalities to exponential decay. The following lemma will 
allow to obtain exponential decay from inequalities of the type given by (1) in Lemma 
10.6. We will use it several times in this paper. 

Lemma 10.7. Let {x , . . . , x^} be a sequence of nonnegative real numbers satisfying, 
for some 7 G (0, 1/2) and each k between 1 and N — 1, 



x k < 7(^fc+l +x k -i). 



(10.39) 
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Let £ := (1 + y/1 - 4 7 2 )/(2 7 ). Then £ > 1 and for any k between and N we have 

x k <x r k + x N r {N ~ k) - (10.40) 

Proof. That £ > 1 is an easy computation. As for (10.40), observe that £ _1 = 7(1 +£~ 2 ). 
Consequently, if we define y k := x fc - (x £~ fc + X7v£~ (Ar ~ fc) ) then y fc < 7(^+1 + yk-i)- 
Since 7 G (0, 1/2), it follows that the sequence {y k } attains its maximum at y or y N . 
Both of these numbers are < 0, so all yj are nonpositive. This proves (10.40). □ 

Corollary 10.8. Suppose that {x Q ,Xi,x 2 , ■ ■ . } is an infinite sequence of nonegative real 
numbers satisfying x k < 7(3^+1 + Xk-i) for some 7 G (0, 1/2) and any k > 1. Suppose 
that Xj — > as j goes to infinity. Define ^ > \ as in Lemma 10.7. Then for any k we 
have Xk < ^o£~ fc - 

Proof. Apply Lemma 10.7 to the first N terms and make N go to infinity. □ 

Lemma 10.9. Suppose that {x_ N , x_jv+i, • • • , xjy} and {z-n,X-n + i, . . . ,z^} are se- 
quences of nonnegative real numbers. Assume that there are positive constants 7 G 
(0, 1/2), X; K and e satisfying: 

• for every j , Zj < Ke~ x ^ N ~^ ; 

• for every —N + 2 < j < N — 2, if zj_ 2 + • • • + Zj +2 < e(xj_ 2 + • • • + Xj +2 ) then 
xj < 'y(x j - 1 + x j+1 ). 

Let £ = £(7) be as in Lemma 10.7, and let a := min{x,ln£}. Then we have, for any 
-N+ 1 <j < N- I, 

Xj < (e-HOe^K + x_ N+l + XN^e'^-^ . (10.41) 

Proof. For any — N + 2 < j < N — 2 we say that Xj is big if Xj > e~ 1 (zj^ 2 H h z j+2 ), 

and otherwise we say that Xj is small. We have Zj_ 2 + • • • + Zj +2 < 5e 2x Ke~ x( - N ~^^ . 
In particular, if Xj i is small then Xj < e^e^Ke^^-^ . Take now any Xj. If Xj is 
small then the previous inequality implies (10.41). So we assume that Xj is big and we 
take the longest sequence of consecutive big elements x p+ i, . . . ,Xj, . . . ,x q -i containing 
Xj. Suppose that x p and x q are small (this need not be the case, since we could have 
p = —N + 1 or q = N — 1). By Lemma 10.7 we have Xj < x p e^ u(y ^ pS) + x q e~ u<yq ~ : '\ Now, 
since both x p and x q are small we can bound (using a < x) 

Xj < e- l ^e 2x K{e"^ N -^-^ + e^ N -\ q \ +q -^) < e^We^R^-"^), 

because both iV— \p\ +j —p and N—\q\ +q—j are greater than N— When p = — N + l 
or q = N — 1 we proceed similarly. □ 

10.4. Proof of Theorem 10.1. By Lemma 3.1 (see Remark 3.4) we can trivialize P in 
such a way that = d + \d0 + a', where A = Res(A, 0, r) and a' is a continuous 1-form 
on D. Let a = Xd9 + a'. We distinguish two situations, according to whether the residue 
is critical or not. 

Suppose first that the residue is critical. In particular, iA is rational, so we can write 
A = ip/q, where p, q are relatively prime integers and q 7^ I. Let n : D* — > D* be the map 
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given by n(z) := z q . Then d n * A = d + q\d9 + ttV = d + pd6 + ttV. Let g : D* -> S 1 
be denned by #(r, 9) := e 2nipd . Then 

:= d g *-x*A = d + n*a' 

is a continuous connection on D. Let 1(B) be the continuous almost complex structure on 
Dxl induced by 5 (see Section 2). By (2.8), the map $ = (t, 5 • (0 o vr)) : ©* -> D x X 
is J(_B)-holomorphic (here t : D* — > D denotes the inclusion), and by (2.9) d<3> has 
bounded L 2 norm. Hence we can apply the theorem on removal of singularities for 
holomorphic curves (as proved for continuous almost complex structures in Corollary 
3.6 of [IS]), deduce that $ extends to a J(5)-holomorphic map $ : D — > D x X, and 
consequently obtain an extension g ■ (<p o n) : D — > X. Furthermore, this extension is 
of type C 1 (because the complex structure is continuous). Let now ( ■= e 2ni / q . Then 
(g-((j)07r))((z) = (g-((j)07r))(z), so that d(g- (0o7r))(O) = 0. Furthermore, since o?7r(0) = 0, 
7r*a' vanishes at 0. It follows from this that d g * 7T *A (g • (0 O7r )) = (d + ir*a')(g • ((f>oir)) = 0, 
from which we deduce that \d g * n *A(g • (0 n ))( r , 8)\ — Mtt*a(0 7r) (r, ^)| < Xr for some 
constant K. This then implies that \dA(f>(r, 9)\ < K'r 1 ^ for some other constant K', so 
the claim is proved. 

Now consider the case of noncritical residue A ^ A cr . Consider the cylinder IR + x S* 1 
as a conformal model of D*, with coordinates t G M + and 6 G S 1 and the standard flat 
metric dt 2 + g^ 2 . We have: 

\a'(t,9)\ < Re-* and \\d a <f>\\ L 2 < 00. (10.42) 

We look at as a map from D* to X, so that (a, <fi) is a pair. Define, for any n G N, 
Z n = [n, n + 1] x S* 1 and let f n := \\d a <f>\\ 2 L 2r Zn y We claim that, for big enough n, we have 

/ n+1 < 7 (A,X)(/ n + / n+2 ), (10.43) 

where j(X,X) is as in Theorem 10.5. Indeed, thanks to (10.42), if n is big enough then 
both the L°° norm of the restriction of a' to [n, n + 3] x S 1 and the L 2 norm of d a <p 
restricted to [n,n + 3] x S 1 are less than the value of e given by Theorem 10.5. Hence, 
we can apply (1) in Theorem 10.5 (identifying Zj = Z n , Zjj = Z n+ i and Zm = Z n+2 ) 
and deduce (10.43). Combining Corollary 10.8 with (10.43) we deduce that 

\\d a m HZn) =fn< KC\ 

where £ > 1. Now, using (2) in Theorem 10.5 we deduce the pointwise bound 

K0M)|<x'r' 

for some other constant K'. This finishes the proof of the case of noncritical residue. 

10.5. Proof of Theorem 10.3. The proof is exactly like that of the case of non- 
critical residue of Theorem 10.1, in Section 10.4 above. Namely, we define f n to be 
IMo0||L 2 ([n,n+i]x5 1 ) and we use Theorem 10.5 (provided e has been chosen small enough) 
to deduce that f n+ i < j(f n + fn+2) f° r some 7 G (0, 1/2) depending on A. Then Lemma 
10.7 gives f n < Ke~ a{ * N ~\ n \\f-N + /jv-i) for some K and a. To deduce the pointwise 
bound (10.33) we use (2) in Theorem 10.5. Finally, (10.34) follows easily from (10.33) 
combined with Lemma 2.1. 
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10.6. Some convexity properties of holomorphic maps. In this subsection we state 
a result which will be crucial for proving most of the convexity results in this paper. Part 
of it can be seen as a particular case of Lemma 10.6 (concretely, the case I = I and 
a' = 0). 

For any 5 G (0, 1/2) we will denote 

Z 6 ■=[-1+5,2-5], Zf-^ZjDZ 6 , Zf n := Z m fl Z 5 . 

(10.44) 

Lemma 10.10. Take on C n the standard almost complex structure I and metric g , 
and suppose that S 1 acts diagonally on C n with weights w-y, . . . ,w n . Let A G iK and let 
: C — > C n be a smooth map satisfying d\de4> = 0. Let 

/ min := min{| — k + iwj\\ \ k G Z, 1 < j < n}. 

Let 7 be the function defined in (10.36). 

(1) If 7 m i„ 7^ then we have, for some 5 > depending on l min but independent of <f>: 

\\d\dB<t>\\h(Z n ) < 7( 2 ^min)(||<iAc»0||L2( Z |) + IMa^II^^))- 

(2) Suppose that A = (so £/ia£ zs holomorphic) and define 

0av(M) := <j>(t,0) -^-J <f>(t,v)dv. 

(Here the subscript av stands for average.) Then, if is noi constant, the fol- 
lowing holds: 

\\<i>Bv\\h(Z n ) < _ 2 , ^(ll^ll^ 2 ^/) + H&v||ia(Z m ))- 



Proof The condition <9Ad(90 = is equivalent to 

90 



a* [ «{w- ixx ^)' (1045) 

where, denoting by (0i, . . . , 0„) the coordinates of 0, we have Af(0) = (ru>i0i, . . . , iw n n ). 
Writing for each j the Fourier expansion <fij(t,6) = ^2kez a kj(t) e formula (10.45) is 
equivalent to the equation a! k - = (—k + i\wj)akj for each Fourier coefficient. Hence 

a k ,j(t) = a kJ {0)e(- k+iXw ^ . Also, since d Xd e(t> = we have |d A de0| 2 = 2|<90/<9t| 2 . Putting 
everything together we write 



/ 

J\t 



rf A^| 2 = EE / 2|o fcj -(0)| 2 |ifc + A|V(-* + ^>*cft. 



Milxs 1 =1 fcez ^ (10.46) 

ife+A^O V 7 

The proof now follows from Lemma 10.11 below. This proves (1). The formula in 
(2) is proved similarly, noting that the integral which computes ||0av|| 2 is of the form 

/ 2~2jez\{o} \ a k,j\ 2 e 2jt dt. □ 
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Lemma 10.11. For any i] > there is some 5 > such that for every 77 G R satisfying 
\r)\ > rjo we have 



Proof. By symmetry it suffices to consider the case 77 > 0. A simple computation shows 
that the inequality 



is equivalent to (5 > l/(e 7 + e~ 7 ). Since the function f(x) := e x + e~ x is increasing for 
positive x, if we set (3 := 7(770/2) then (10.48) holds for every 77 > 770- On the other hand, 
if we set 5 := 1/2 then (10.47) holds for every 7/ bigger than some 77' > i] > 0. Indeed, 
the left hand side in (10.47) grows as a function of 77 as e v , whereas the right hand side 
grows as e^ 2- ^ = e 3r? / 2 . On the other hand, since the inequality (10.48) is strict for 
(5 = 7(770) and every 77 > 770, we can pick some very small < 5 < 1/2 such that (10.47) 
holds for every 77 G [^o? 7 ?]- Then (10.47) will also hold for every i] > i]q (of course, if for 
a given 77 (10.47) is true for some 5 = 5$, then it is also true for any < 5 < 5o). □ 

10.7. Compactness of the set of pairs of small energy. A sequence of pairs {(a n , 0„) 
C — > X} will be said to converge to a pair (a, 0) : C — > X if a n — > a in the L°° norm 
and 4>n 0o in L p lloc for any p < 00. This implies that for any compact subset K C C 
the energies ||cio„0n||L 2 (x) converge to ||rf ao 0o|U 2 (_ft:), and that the maps <p n converge to 
0o in the continuous topology. 

The following lemma is, except for the last statement, a combination of Lemma 3.1 
and Corollary 3.3 in [IS] (see Definition 3.1 in [IS] for the notion of uniformly continuous 
almost complex structure on a manifold). 

Lemma 10.12. Let Y be a (nonnecessarily compact) manifold. Let h be some metric 
on Y , and let I be a continuous almost complex structure on Y. Let Z C Y be a 
closed h-complete subset, such that I is uniformly continuous on Z w.r.t. h. There 
exists a real number e = e(J , Z, h) with the following property. Let {I n } be a sequence 
of continuous almost complex structures on Y such that I n — > I in C° -topology of Y . 
Let u n G C° Pi L 2 loc (©, Y) be a sequence of I n -holomorphic maps such that u n (B>) C Z, 
||rfwra|U 2 (D) < e and u n (0) is bounded in X . Then there exists a subsequence {n„J which 
L\ loc converges to a I -holomorphic map for allp < 00. In particular, for any K d D 
the norms ^du n ^i?(K) tend to ||g^oo||l 2 (k)- Furthermore, if for some p > 2 and k > 
the L\ norms of I n are uniformly bounded and \\I n — Io\\l p ~^ 0; then -u^ is of class L\ 
and the subsequence can be chosen to converge to in L p k . 

Proof. We prove the last statement: by Proposition B.4.7 in [McDS], each n is of class 
Lif. +1 , and since L p k+1 — > L p k is compact, passing to a subsequence there is a limit 0„ — ■> 




(10.47) 




(10.48) 



in L p . 



□ 
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Using the previous lemma we prove the following result on convergence of pairs with 
small energy. 

Lemma 10.13. Let {/„} be a sequence of continuous almost complex structures on X 
such that I n I in the L°° norm, where I is also a continuous almost complex structure. 
Let C be a (nonnecessarily compact) complex curve with a Riemannian metric. Let 
(cx n ,<f> n ) : C — > X be a sequence of I n -holomorphic pairs. Assume that \\da n \\L<x> is 
uniformly bounded, and that \\d an (f) n \\L2(c) < e /2 (where e is as in Lemma 10.12). Then, 
for any compact subset M C C, there exists a subsequence of pairs {(ot nk , 4> nk )} whose 
restriction to M converges, up to regauging, to a I -holomorphic pair (a, (f)) : M — > X. If 
for some p > 2 and k > the L p k norms of a n , I n , a and I are uniformly bounded, then 
<j) is of class L\ and the subsequence can be taken so that n — > <fi in L p k . 

Proof. Since da n has uniformly bounded L°° norm we can assume, up to regauging, 
that the sequence a n is uniformly continuous and L°° bounded. By Ascoli-Arzela it 
follows that there is a subsequence which converges in L°° norm to a continuous 1- 
form a G r2 1 (-D,i!R). Take the corresponding sequence of pairs and denote it again 
by {(a n , (f> n )}, so that a n converges to a in the L°° norm as n — > oo. By a standard 
patching argument in gauge theory (see §4.4.2 in [DK]) it suffices to prove the result of 
the lemma for a finite collection of disks covering C. Let h be the product metric on 
Y = C x X, let $ n : C — > Y be the section corresponding to <f> n , and let h n := g(a n ) 
and h := g(a). Then we have (see (2.9)) = 1 + \d an (f> n \ 2 . Since a n — * a in L°° 

we deduce that for a big enough n we can bound h n < \^2h. It follows that, if D d D 
is a disk of area e 2 /(\^2K), where if is a suitable constant depending on the metrics h 
and h, then ||ci$n|U,L 2 (D) < e - Hence, if / : D — > D is an affine biholomorphism, then 
||d($ n o f)\\h,L 2 (v>) < Finally, it follows from a n — > a that I n (a n ) — > 1(a) in L°° norm. 
Taking into account (2.8), we can apply Lemma 10.12 (it is clear that Z := D x X 
satisfies the hypothesis of Lemma 10.12) to the sequence of maps {$ n o /}, and deduce 
that a subsequence converges to a map defined on D. The last statement follows from 
the last statement in Lemma 10.12 (since p > 2 and k > 1, if a n has bounded L p k norm 
then I(a n ) is also bounded in L v k ). □ 

In the following two corollaries we use the same notation as in Section 10.2, so C 
denotes the cylinder [—1, 2] x S 1 and Z = [0, 1] x S 1 . 

Corollary 10.14. Let I be an almost complex structure on X . For any e > there exists 
some 5 > such that if (a, 0) : C — > X is a I -holomorphic pair satisfying \\da\\L°°(c) < $ 
and \\d(f)\\L2(Q < 5 then 

diam 5 i (f>(Z) < e. 

Proof. We prove the corollary by contradiction. Take any e > and assume that there is 
a real number 5 > and a sequence of /-holomorphic pairs (a n , (p n ) : C — > X such that 
||da! n ||i,oo(c) — > 0, ||rfo n 0n||L 2 ( C .) — > and diam 5 i <f> n (Z) > 5. By Lemma 10.13 we can 
assume, up to restricting to a subsequence and regauging, that there is a /-holomorphic 
pair (a,0) : Z — > X such that the restriction of {(a n ,(f> n )} to Z converges to (cx,<f>). 



A COMPACTIFICATION OF THE MODULI SPACE OF TWISTED HOLOMORPHIC MAPS 51 



This implies in particular that ||<i a 0||L 2 (z) = and that diamsi <j){Z) > 5, which is 
impossible. □ 

Corollary 10.15. For any p > 2, K and e > there is some 5 > with the following 
property. Suppose that (a, 0) : Z — > X is a I -holomorphic pair, that ||rfa|| L p < K, and 
that sup {1/2}x5 i \d a (f)\ > e. Then \\d a <f)\\ L 2( Z) > 5. 

Proof. Fix p, K and e, and suppose there is no 5 > satisfying the hypothesis of the 
Corollary. Then there is a sequence of pairs (a u ,<f) u ) : Z — > X such that ||o?a u || L p < K, 
su P{i/2}x5 1 \da u (fiu\ > e and ||d Qu u ||£2( Z ) — > 0. Using the last statement in Lemma 
10.13, we deduce that there is a subsequence (which we denote again by (a u , U )) which 
converges to (a, 0) in L\. In particular, d a (f> = 0, but also d au (j) u converges pointwise to 
d a <p, which is a contradiction. □ 

10.8. Proof of Lemma 10.6. The strategy will be to reduce the lemma to the case 
of the standard complex structure Iq on C n and a = \d9. We will do this by means 
of a compactness argument. Before stating the argument, we need some preliminaries. 
Permuting the coordinates if necessary, we can assume that for some p we have 

for any 1 < j < p, Wj ^ and w p+ i — • • • — w n — 0. (10.49) 

Let q = n — p, so that we have a splitting C n = C p x C q . Let tc p , ir q be the projections 
from C n to C p and C q . The following lemma will be proved in Section 10.9. 

Lemma 10.16. There exists some constant K > with the following property. Let 
(a, 0) : Z — > C n be a I -holomorphic pair such that \\a — \d9\\Loo < |A|/2. For any 
< 5 < 1/2 there exists some y G C q such that 

SUp \4>{Z) - (0 )2 /)| < l^S-WmdatWv* + 1). 

zez s 

We first prove (10.37). Suppose that no matter how small e is there are pairs for which 
(10.37) does not hold. Then one can chose a sequence of equivariant almost complex 
structures I u and /^-holomorphic pairs (a u , <j) u ) : C — > C n satisfying: 

114-411^^0, \\a u - \d9\\ L oo — > 0, \\d au (f>u II l 2 (c) — >■ 0, 

and for which the inequality (10.37) holds in the opposite direction: 

\\d«nh {ZlI ) > 7(2Un) (114011^^) + \M\\h(z M )) ■ (10-50) 

In particular, the energy of (a u , U ) is nonzero. Let e be less that the epsilons 
appearing in Lemmae 10.16 and 10.13, and define for every n a new pair (a' u ,(j)' u ) by 
setting a' u := a u and (j>' u (z) := (f) u (z)e/2E u . Let also V u be the pullback of I u by the 
homotecy of ratio 2E u /e. Then (ct^,0^) is a /^-holomorphic pair and its energy is e'/2. 
Furthermore, since I is invariant under homotecies, it follows that \\I' U — Io\\l°° ~^ 0. 

Let 5 > be the number given by (1) in Lemma 10.10 for our value of l min . Let 
y u G C q a point, as given by Lemma 10.16, such that (f)' n (Z 5 ) is contained in the ball 
centered at (0, y) and of radius Kl~^ ia 5~ l . Finally, let 0" := 4>' u — (0, y u ) and let /" be the 



52 



I. MUNDET I RIERA AND G. TIAN 



pullback of I' u by the translation in the direction (0,y u ). It follows that (a' u , 0") is 
holomorphic (note that I" is S^-equivariant) and (f)'l(Zs) is contained in the ball centered 
at of radius (e + l)Kl~^ n 5~ l l 2 . At this point we use Lemma 10.13 to deduce that, up 
to regauging, there is a subsequence of {(a' u , 0")} which converges to a 7 -holomorphic 
pair (Xd9 : 0) : Z 5 — > C" which has energy e/2 and such that 

\\d\d8<l>\\v*{z n ) > l{^m.M\dxde<t>\\ L \zl) + \\dxde(f>\\ L ^zf n ))- (10.51) 

Now, comparing this with (1) in Lemma 10.10 we obtain a contradiction. This finishes 
the proof of (10.37). 

The inequality (10.38) can be reduced, following the same strategy, to the case of the 
standard complex structure Jo on C n and the connection a = Xd6. Suppose for simplicity 
that n = 1 and that w\ — 1 (the general case offers no extra difficulty). Let : C — > C 
be a map which satisfies 



dt 



89 



+ A0 



(10.52) 



We want to prove that 



sup 

z 



d<j> 


< K 


d(f) 


dt 




dt 



L 2 (C) 



It 



for some constant K. Let ip := d<p/dt. Then if) satisfies the same equation (10.52) as 
follows from that that the function ( = e~ lXt ip is holomorphic on C . Define the constants 
Kq := sup tg [ 01 ] e lA * and K\ := sup te r 01 i e _lA< . Take any z G Z and denote by D C C the 

Now 



disk of radius 1 centered at z. Since ( is holomorphic we have \((z)\ < ||C||l 2 ( 
we bound: 

\i/>(z)\ < K \C(z)\ < K \\(\\ Lm < KoKMlvw < KqKMI^o, 
which is what we wanted to prove. 



10.9. Proof of Lemma 10.16. First of all we state and prove two lemmae which will 
be necessary for proving Lemma 10.16. For the first one we introduce the following 
notation. Let M be the manifold K. x S* 1 x C n . Take a 1-form a on R x S 1 with values 
in iR, and let J := 1(a) be the complex structure on M induced by a and the action of 
S 1 on C n with weights w±, . . . ,w n as in (10.49). Consider also the metric h := g(a) on 
M induced by a and the standard metric g on C n . 

Lemma 10.17. For any K' > there exists some K" > with the following meaning. 
Let a G fi 1 (lR x S^iM) be a connection 1-form. Suppose that \\ol\\l°° < K' and that 
$ = (i, (/)) : Z — > M is a J -holomorphic map, where i : Z — > R x S* 1 is the inclusion; 
suppose also that for some z G Z and R> we have d((f>(z), (f>(dZ)) > R. Then 

||d$||| 2 = area(Z) + K^l^ > K"R 2 . 

Proof. This is similar to a standard result in the theory of pseudo-holomorphic curves. 
First, since $ is J-holomorphic, ||g?<&|| 2 is equal to the area of ®(Z), so we have to prove 
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that area($(Z)) > K"R 2 . Define, for any R > r > 0, the following sets: 

M r = {(t,0,x) e M \ \x - (f>(z)\ < r}, B r = dM r , Z r — <& -1 (M r ). 

Define also f(r) := area(<I>(Z r )). A real number r will be called regular if it is a regular 
value of the function Z 3 y i— > \4>(y) — 4>{z)\. For any such r, $>{Z r ) C M is a smooth 
subsurface with boundary. Also, the set of nonregular values has measure zero. 

We first prove an upper bound for f(r). There exists some constant K 1 > such that 
for any regular r there is a smooth subsurface EcM such that <9£ = d§(Z r ) and such 
that area^(E) < K\ length(9<l ) (Z r )) 2 . Let u(a) be the minimal coupling symplectic on 
M (see Section 2). Since H 2 (M, R) = and since a; (a) has bounded L°° norm (because 
a is bounded), it follows that, for some constant K 2 and any regular r, 

f uj(a) = - I uj(a) < K 2 area(S) < K 2 K X length(9$(Z r )) 2 . 

J<$-(Z r ) JY, 

On the other hand, since $ is J(o;)-holomorphic we can identify the first integral with 
the area of $(Z r ). Hence we have proved 

fir) < K 2 K X length(9$(Z r )) 2 . (10.53) 

To finish the proof of the lemma, observe that there exists some constant K 3 such that 
for any regular r we have f'(r) > K% length(9$(Z r )). Combining this with (10.53) we 
deduce f'(r) 2 > K 3 f(r)/(K 2 Ki). Integrating over r we deduce the result. □ 

Let X : C n — > C n be the vector field generated by the infinitesimal action of S 1 on C n . 

Lemma 10.18. There exists some constant K\, independent of X, such that if f : S 1 — > 
C p C C n is any smooth map, and (5 : S 1 — > iR satisfies \/3 — A| < A/2 £/ien 



supi/i^cUill/' + W)!!^ 



(Si)- 



Proof. Let '■— j~ I P(0)d9. Define (7 := e s f, where s : S* 1 — > iR satisfies ds = f3 — 
We have |/| = |g|, and one checks easily that |/' + ^^(J)! = \g' + ^^(fi 1 )! pointwise. 
Hence it suffices to prove that for some universal constant K x (depending on A) and any 
constant flo satisfying \(3o — A| < A one has the inequality 

sup \g\ < / mi 1 n fs:i||^ + / 3oA'(^)|| L 2 (5 i ) . 

Now, this can be proved by considering the Fourier series of g. □ 



After these preliminaries, we now prove Lemma 10.16. Pick K\ satisfying the require- 
ment of the previous theorem and in such a way that for any h : S 1 — > C q C C n and any 
9,r) G S 1 we have 

\h(6)-h( V )\ < KiWtiWvw) (10.54) 

(this is possible in view of Cauchy-Schwartz). Let K" be the constant given by Lemma 
10.17 for the value K' = 3|A|/2. Take K > big enough so that for any positive E > 
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we have 

^lJ- 1/2 K(E + 1) > 6 max fa n <T ^K.E, ^(E* + 2ix)/K" } . 

1 J (10.55) 

There exists some a G [0, 5] such that 

IK0||£a({a}xSi) ^ 5 _1 ||d a 0|| 2 . 

Denote the components of 0(a, 9) by (f(9), h(9)). Let /? : S* 1 — > iM be the map such that 
for any 9 we have a(a, 9) = (3{6)d6 + j(9)dt. Applying Lemma 10.18 to the component 
/ and inequality (10.54) to the component h (together with / min < 1) we deduce that 

sup |0M) - (0,y a )\ <p:= l-lJ-^K.Wd^Wv, 
ess 1 

where y a = h(l) (indeed, f + (3X(f) can be identified with d a 7i p (<f)) to the circle {a} x S 1 , 
and we have ||d Q 7r p (0)|| < ||d a 0||). Hence, 0(a, S* 1 ) C C n is contained in the ball centered 
at z a := (0,y a ) of radius p. Similarly, there is some b G [1 — 5, 1] and y b G C q such that 
0(6, S* 1 ) C C n is contained in the ball centered at Zf, := (0, y&) of the same radius p. 

Let Z' := [a,b] x S 1 and let 

i? : =/ m i n r 1/2 K(||rf a 0|| L2 + i). 

We claim that 4>(Z') is contained at least in one of the balls B(z a , R) or B(zb, R). Suppose 
this is not the case. Then <j>(Z') is not contained in the union B(z a ,R/3) U B(z b ,R/3). 
Indeed, if this were true, then, since 4>{Z') is connected, we should have B(z a , R/3) H 
B(zb, R/3) 7^ (otherwise 4>{Z') would be included either in B(z a , R/2) or in B(zb, -R/2), 
in contradiction with our assumption). But in this case d(z a , Zf,) < 2R/3, whence </>(Z') C 
B(z a , R/3) U B(z b , R/3) C B(z a , R), which we assume not to be true. It follows from all 
this that there is some z G <f>(Z') satisfying simultaneously 

d(z,z a ) > R/3 and d(z,z b ) > R/3. 

In other words, the ball B(z,R/Q) is disjoint both with B(z a ,R/Q) and B(z b ,R/Q). 
By our choice of R we have R > 6p, so the discussion above proves that <p(a, S 1 ) C 
B(z a ,R/Q) and (f)(b, S 1 ) C B(z b ,R/Q). We deduce that B(z,R/Q) n <j)(dZ') = 0. Hence 
we can apply Lemma 10.17 to (0, a) : Z' — > C n and deduce that 

2tt + K0||i 2 > area(Z') + ||rf a 0|| 2 2(z;) > K"R 2 /36. 

However, it follows from (10.55) that we must have K"R 2 /36 > 2n + ||d a 0||| 2 , which is 
a contradiction. This finishes the proof. 

11. Long cylinders with small energy and nearly critical residue 

In this section we use the same notations as in the previous one. 

For any natural number N, let Cn = [-N, N] x S 1 with the standard product metric 
and the induced conformal structure. We denote as always by (t, 9) the usual coordinates 
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in Cn- Let v = fdt A d9 be a volume form, and let r] > be any number. We say that 
v is exponentially ^-bounded if 

1/1 < ve lthN . (11.56) 

Volume forms satisfying this property arise when we consider Cn as a conformal model 
for open sets U in stable curves (C, x) of the form U ~ {xy = 5} C C 2 for small 5, and 
we take on Cn the volume form corresponding to the metric f[c, x ]- 

Theorem 11.1. Fzx some number c G iR. There exist numbers e > 0, r]o > 0, o~ > and 

If > (depending only on X , the action of S 1 on X , and I) with the following property. 
Let (a, 0) : Cn x S 1 — > X &e a pazr. Suppose that 9/ >a = and taat tnere exzsis some 
critical residue A cr G A cr snca ina£ 

|| a; — A cr d#||Loc < e and ||da0||.L°° < £■ (11.57) 

Taen there exist maps ip : [—N,N] — > X Acr and 0o : Cn — > TX satisfying <f) (t,9) G 

c/,(t, 9) = e~ x " e exp» (e^Voft d )) and I ^Mt, 6)d6 = 0, 

■> (11.58) 

where exp^ : T^X — * X denotes the exponential map on X with respect to the metric 
g (see also Remark 11.2 for the precise meaning of the formulae). We distinguish two 
cases. 

(1) Let v be an exponentially rj-bounded volume form on Cn, where r] < i]q. Suppose 
that 

L v da + fj,((f>) = c; (11.59) 
then the following inequality holds: 

|0oM)| < Ke-°( N -W (||d a 0|| LOO + |H| L oo +r](\c\ + sup|/i|)) 1/4 . 

(11.60) 

(2) Suppose now that da = 0; then the previous estimate can be improved to: 

|0oM)| < Xe-^-l*D(|K0|| L oo + |M|l~) 1/4 . (11.61) 

Remark 11.2. Note that the first formula in (11.58) is well defined. To see this, denote 
A cr = ip/q, where p and q are relatively primer integers. Then the condition ip C 
X Xci implies that for any t the point ip(t) is fixed by the action of e 2m l q . On the other 
hand, e ±x " e is well defined up to multiplication by powers of e 2nl ^ q . Finally, since the 
exponential map is equivariant we have e~ 2m l q exp^ t \ (e 27T1 ^ q v) = exp^ (n) for any vector 
v G T^qX. As for the second formula in (11.58), it should be understood as 

2wq 



/pzivq 
e x " e <f)(t,9)d9 = J e x " e <j)(t,9)d9. 
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The proof of Theorem 11.1 will be given in Section 11.1 below. Next theorem states 
that the map ip constructed in Theorem 11.1 in approximately a gradient flow line for 
the moment map /i. Before stating it we introduce some notation. Suppose that (a, 0) 
is a pair satisfying the hypothesis of Theorem 11.1. Applying a gauge transformation 
if necessary, we can assume that that a is in temporal gauge, a = agd6, and that ag 
restricted to {0} x S* 1 takes a constant value A + A cr G iffi. Then we can write 

a = (A + A cr + (3)d6 (11.62) 

for some function (3 : Cn — > iM. 

Theorem 11.3. Following the notation of Theorem 11.1, we distinguish again two cases. 

(1) Suppose that i v da + /i(0) = c, where v is rj-bounded and 77 < i] Q ; then 

W(t) + iA/(V>(0)<*W))l < Ke-°( N -W (|K0|| L oo + HalUoo + v (\c\ + sup H)) 1/4 , 

(11.63) 

holds for any t, where K is a constant depending only on X , the action of S 1 , 
and the almost complex structure I; 

(2) suppose that da = 0; then, for any t, we have 

W(t)+VJ(i/>(t))X(m)\ < Ke-^-MWldatWLoo + |MU~) 1/4 , 

(11.64) 

The proof of Theorem 11.3 will be given in Section 11.2 

11.1. Proof of Theorem 11.1. We first prove the theorem under the assumption that 
the critical residue A cr is equal to 0. 

The existence of ip and 0o satisfying (11.58) follows easily from the implicit function 
theorem (see for example Section 14 in [FO]) provided e is small enough (so that ||a!||z,°° < 
e together with Hefc^H^ < e imply that for any t the image of <f)(t, •) : S 1 — > X is contained 
in a small ball of radius less than the injectivity radius of X and g). 

To prove that the estimate (11.60) follows from (11.59) we use the same strategy as in 
the proof of Theorem 10.4, namely, we make use of an inequality involving the L 2 norm 
of 0o restricted to three consecutive pieces of the cylinder and then using the fact that 
(a, 0) is J-holomorphic to deduce pointwise bounds on O from L? bounds. To state the 
inequality we define as in the previous section C = [—2,3] x S 1 . Recall also that when 
we write any norm (L°°, L 2 , etc.) of either the connection or the section of a pair defined 
over C, or of a complex structure depending on points of C, we mean the norm over C 
(unless we specify some other domain). Let also Zj, Zn and Zui be the subsets of C 
defined in (10.35). 

Theorem 11.4. There exist some constants e > and K > 0, depending only on X 
and I , with the following property. Let (a, 0) : C x S* 1 — > X be a I -holomorphic pair 
satisfying \\o[\\l°° < e and ||<i Q 0||L°° < e. Let ip : [—2,3] — > X and O : C — > TX be 
defined as in Theorem 11.1. 
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(1) Suppose that \\da\\L2^ < e||0 o ||L 2 (c)- Then 

\\Uh { z n) < (WMhiz,) + \\Mh { z in) ) (11-65) 

(2) Suppose now that \\da\\ L 2 < K and ||do;||x,oo < K. Then sup z |0 O | < -^ll^oll^ 4 - 

Theorem 11.4 will be proved in Section 11.3, and we now prove Theorem 11.1. Take 
e as in Theorem 11.4. Since d a (p = d(p — iaX((p), we can bound 

\\d</>\\ LO o < K(|K0|| L oo + \\a\\ L oo). (11.66) 

As a consequence, and taking (11.57) into account, we deduce that ||d0||i,oc is uniformly 
bounded. Let v = fdtAdQ. Equation (11.59) can be written as da = f(c — /i((f)))dtAd9. 
Since /i(0) is uniformly bounded (because X is compact), once c has been chosen we 
can take 770 in such a way that the requirement of v being exponentially 77-bounded 
(and hence 770-bounded as well) implies that H^aHi,^^) is necessarily less than the K in 
Theorem 11.4 (here we use that ||d0||i,oo is uniformly bounded). On the other hand, we 
have 

\da\(t,6) < r]{\c\ +supH)e l ' hJV . (11.67) 
In the sequel we will denote for convenience R := 77 ( | cj + sup 

Define, for every — N < n < N, Z n = [n, n + 1] and x n := ||0o||l 2 (z„)- Formula (11.66) 
implies that for any n we have 

x n < K(\\d a (f)\\ L oo + \\a\\ L oo). (11.68) 

Let also z n := \\da\\L2(z n )- Formula (11.67) implies that z n < KoRe~^ N ~^ for every n, 
where K is a universal constant. On the other hand, Theorem 11.4 implies that if 

Zn-2 + • • • + Z n+ 2 < t{x n -2 + • ' • + ^n+2) 

then x n < (e + e _1 ) _1 (x n _i + x n+ i). Hence the sequences {x n } and {z n } satisfy the 
requirements of Lemma 10.9. Applying Lemma 10.9 together with (11.68) we deduce, 
for some a > independent of (a,0), an estimate 

WMl^z,) = Xj < e- l K {R + ||rf a 0|| L oc + \\a\\ Loo )e-^ N ^\ 

To finish the proof of the theorem and obtain (11.60), combine the previous inequality 
with (2) of Theorem 11.4 (the two necessary conditions for section (2) of Theorem 11.4 
are satisfied in our situation: the L\ norm of da on the whole Cn is less than K and da 
has bounded L°° norm by (11.67)). 

The proof that da = implies (11.61) follows exactly the same scheme (and is even 
easier) . 

Now we consider the case of general critical residue A cr G A cr . Let us write A cr = ip/q 
for some relatively prime integers p, q satisfying q > 1. Consider the covering map 

defined as n(t, 9) := (qt, q9), and define a new pair (a', 0') := n*(a, 0). The first inequal- 
ity in (11.57) is equivalent to \\a' — pd9\\ L oe < qe. Applying the gauge transformation 



58 



I. MUNDET I RIERA AND G. TIAN 



g(t,6) := e ipd we obtain another pair (o/',0") := g*(a',(f)') = {a' - pd9,e i6p <f)'). The 
new connection a" now satisfies 1 1 cu" 1 1 ^,00 < qe, \\d < qe and <9/ i( y0" = 0. Also, 

equation (11.59) is satisfied with v replaced by n*v (which now is exponentially qr\- 
bounded). Hence, provided e and r\ are small enough, we can apply the case of zero 
residue to the pair (a", <fi"). We thus arrive at a pair of maps ip" : [—N/q,N/q] — > X 
and 4>q : C^/ q — > TX satisfying <fi" = exp^„ 4>q and f (f>o(t, v)dv = 0. We now prove that 
ip" C X Acr . Let a := 27r/g. For any t, 9 we compute 

(f)"(t, a + 0) = e i(a+e)p <j)'(t, a + 9) = e iap e iep (f)\t, 9) = e iap (j)"(t, 9), 

which, combined with the fact that g is ^-invariant, implies ip" C X Xci . The computa- 
tion above implies similarly that 0o(t, a + 9) = e iap (pQ(t, 9), so that the map 



y {t,9) :=e-*>t%(t,e) 
satisfies (p'^it, 9) = ^(t, a + 9) and hence descends to give a map O : Cjy — > X such that 

^(t,9) = Mqt,q9). 

Define also ip : [— N, N] — > X Xci by the condition ip"(t) = ip{qt). Then the equality 
0" = exp^// 0q translates into 

<P(qt,q9) = e- iep ex Pij{qt) (e iep Mqt,qO)), 

which is equivalent to the first formula in (11.58). On the other hand, the balancing 
condition J 0q(£, v)dv = is clearly equivalent to J^ 9 e XcrL, (f)o(t, v)dv = 0. 

Finally, since the set of q which appear as denominators of numbers in iA cr is finite, 
e and 770 can be chosen in such a way that for any critical residue we can reduce to the 
case of zero residue applying the procedure which we just described. 

11.2. Proof of Theorem 11.3. As in the proof of Theorem 11.1, we prove the case 
A cr = 0. Using the same covering argument as in Section 11.1 the general case is reduced 
to this case. Furthermore, we only prove that i v da + /i(0) = c implies (11.63). That 
da — implies (11.64) is even simpler and follows from the same ideas. 

We begin by stating a local version of the theorem. 

Lemma 11.5. Let K > be a real number, let I (resp. g, X) be an almost complex 
structure (resp. Riemannian metric, vector field) on C n , and let V C C n be a compact 
subset. Let Z := [—1,2] x S 1 , and let (j) : Z — > V be a map satisfying 

§ = /(0) (M" i(A + /9)W) )' (U ' 69) 

Suppose that the diameter of <p(Z) is small enough so that ip : [0, 1] — > C n and 0o : Z — > 
C n can be defined as in Theorem 11.1 (here we identify T^( t )C n ~ C n ) and that 



|0o| < Kq and 



89 



dt 



< K Q . (11.70) 



A COMPACTIFICATION OF THE MODULI SPACE OF TWISTED HOLOMORPHIC MAPS 59 



Then for any t e (0, 1) we have 

\i//(t)+DJMt))XMt))\ <^sup(|0 o | + |/3|) (11.71) 

z t 

for some constant K independent of (ft, where Z t := {t} x S 1 . 

Proof. Let V := {expgf \ x <E V,\v\ < K }. By hypothesis we have ip([0, 1]) C V. Let 

B C C n be the closed ball of radius fTo? let -E : V x B — > C n be the exponential map 
E(x,v) := exp^f. Consider the first derivatives F := -D^-E 1 and G := D V E. Since the 
domain of E is compact, the second derivatives of E are uniformly bounded; combining 
this observation with the fact that F(x, 0) = Id and G(x, 0) = Id for any x, we obtain 
bounds 

\F(x,v) -Id | < K\v\ and \G(x, v) - Id | < K\v\. (11.72) 
Taking partial derivatives with respect to t in the equality exp^, O = <p we obtain 

Writing F = ld+(F — Id) and G = Id+(G — Id) and integrating over S* 1 the above 
equality combined with (11.70), (11.72), and the fact that f (j>o(t,9)d9 = for every t, 
yields 

< if sup 1 0o | . (11.73) 

z t 

Using (11.69) we compute 

^ = - iA/(VW) + / + W) - /W) - iA^(^)) 

- iAJty)(*(0) - A'W) - 1/3/(0)^(0). 

Integrating over S* 1 and dividing by 27r the first term in the right hand side does not 
change; the second term vanishes, because is independent of 9 and the integral of 
d(j)/d9 over S l is obviously 0; the third and fourth term can be bounded by K sup Zt |0o|, 
since both — I{4>)\ and \X{i])) — X{4>)\ are less than i^sup^ |0o| (of course here 

K depends on the derivatives of / and X) and (11.70) holds; the fifth term can be 
bounded by fTsup Zi \f3\. Putting together all this observations and using (11.73) we 
obtain inequality (11.71). □ 

Covering X with a finite number of charts we deduce from the lemma that for any 
* E [-N, N]: 

\ii\t) + < K sup (|0 O | + (11-74) 

(t)xS 1 

Note in fact that the constant Kq in the Lemma can be taken independently of (a, 0), and 
depending only on the e in Theorem 11.1, because (11.57) implies the bounds (11.70). 
Combining this with the fact that the number of charts which we use is finite, we deduce 
that the constant K in the above inequality is independent also of (ct,0). 
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To finish the proof of Theorem 11.3 we need to bound the right hand side of (11.74). 
On the one hand, formula (11.60) in Theorem 11.1 tells us that 

\<j>o(t,8)\ < Ke°W- N ^ (IK0ULOO + IHUoo +rj(\c\ + sup |/i|)) 1/4 . 

(11.75) 

On the other hand, we can bound (3 writing 

P(t, 9) = 0(0, 9) + jf |?(r, 9)dr = jf ^(r, 9)dr, 

then using da = ^§fdt A d9 and (11.67) to obtain 

\(3(t,9)\< [ \da(T,9)\dT< [ e |r| ^(|c| +sup|/i|)rfr 
Jo Jo 

< e^~ N r]{\c\ + sup H) < e a ^- N) ri(\c\ + sup |/i|) 

< Ke a ^- N \r]{\c\ + sup H)) 1 / 4 (11.76) 

where we have used that a < 1 and the fact that 77 ( j c| + sup |//|) is bounded above. 
Combining the estimates (11.75) with (11.76) with (11.74) we obtain the desired bound. 

11.3. Proof of Theorem 11.4. Using Lemma 10.4 we reduce the proof of Theorem 
11.4 to a local version of it, very much as we did in the proof of Theorem 10.5. We 
use the same notation as in Lemma 10.6, so we fix a diagonal action of S 1 on C n with 
weights w = (wi, . . . , w n ) G Z n . Let I (resp. go) denote the standard complex structure 
(resp. Riemannian metric) on C n . 

Lemma 11.6. For any r > and K > there exists some e = e(K,w,r) > with 
the following property. Suppose that I (resp. g) is a smooth equivariant almost complex 
structure (resp. Riemannian metric) on C n such that \\I — Io\\l°° < £, Il-D-^ IU 00 < K , and 
\\D 2 I\\ L oo < K (here DI and D 2 I denote the first and second derivatives of I and the 
norms are taken with respect to the standard metric go on C n ), and \\g — go\\L°° < e and 
ll-Dfi'lU 00 < K. Let (a,(j>) : C x S 1 — > C n be a I -holomorphic pair satisfying ||q;||loo < e, 
\\d a 4>\\L°° < e and 

0(C) C B(0,2r) C C n . 
Let ip : [—2, 3] — > C n and O : C — > TC n be defined as in Theorem 11.1. 

(1) Suppose that \\da\\L2 < e||0 o ||L 2 - Then 

WU^Zu) ^ {\\Ml HZl) + Il0o|li 2( ^)) (11-77) 

(2) Suppose now that \\da\\ L 2 < K and ||do;||x,oo < K. Then sup z |0 O | < -^ll^ollii 4 - 

The proof of the lemma will be given in Section 11.5 below. 

Remark 11.7. We specify the dependence of e on DI, D 2 I and Dg because we want to 
be sure that the hypothesis of the lemma are preserved when we zoom in ( this will become 
clear in the course of the proof). The same comment applies for Lemma (11.8). 
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11.4. Comparison between <f) and av . When (a,(f>) is a pair taking values in C n , 
the map <po defined in Theorem 11.4 can be roughly speaking approximated by the map 
av defined in (2) of Lemma 10.10. The following lemma makes this statement precise, 
specifying to what extent av is a good approximation of (fio, both in L? and C° norms. 
This will be crucial in proving Lemma 11.6, since it is clearly easier to deal with av than 
with 0o- 

Lemma 11.8. For any K > there exist constants Kq > and e > with the following 
property. Denote by go the standard Riemannian flat metric in C n . Let g be another 
metric on C n such that K~ l g$ < g < Kg and such that \\Dg\\ L oo < K, where Dg 
denotes the first derivatives of g and the norm is taken with respect to go- Let x G M 
and let 70 : S 1 — > C n be a smooth map satisfying J 70 = and sup |to| < e. Let 
j(9) := exp^7o(6 l ) (here we are identifying T x C n ~ C n ) and define 



Then we have 



7av(#) := 1(9) -±J l(v)dv. 



|7o|| L 2(l - .K0II70IU2) < ||7av||z,2 < ||7o|U 2 (l + ^o||7o|U 2 ) 



and similarly 



sup |7o|(l - Kosup |7o|) < sup |7 av | < sup |7o|(l + Kosup |7o|). 



11.78) 



(11.79) 



Proof. We claim that given K > 0, e > 0, and a metric g on C n satisfying K~ x go < g < 
Kgo and such that || -D^? || z.<=° < K, there is a constant K' depending only on K such that 
for any x, v G C n satisfying \v\ < e we have 

\<K'\v\ 2 . (11.80) 



exp 9 x v 



x 



To see this, define ^(t) := exp 9 x tv. Since 7(t) is a geodesic, we have iU 
(here 71, . . . ,72n are the components of 7). Integrating and using 7'(0) = v we deduce 
that (provided e > has been chosen small enough and < e) for any t G [0,1], 
|7'(t) — v I < K'\v | 2 , where K' is proportional to the sup norm of T^- which, on its turn, 
can be estimated in terms of (g 1 *) = (gij)^ 1 and the derivatives of (gij). Integrating this 
inequality we obtain (11.80). 

It follows from that (11.80) that 

<^'ll7o|| 2 L2 . 



1 








x — 7T- 

2tt J 






x - ^- J expf 70 



Consequently we have 
\l™{9) - 70 



7 



10(6) 



< 



x + lo(9)-(^ I l)-lo(9) 



<^'(ll7o||i 2 + |7oW| 2 ). 
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Integrating over 9 we obtain (11.78). Inequality (11.79) is proved similarly. □ 

11.5. Proof of Lemma 11.6. We follow an idea similar to the proof of Lemma 10.6 
in Section 10.8. Assume that there exists sequences of positive real numbers e u — > 0, 
invariant almost complex structures I u and metrics g u , and J u -holomorphic pairs (a u , (p u ). 
Suppose that /„, g u and (a u , <j) u ) satisfy the hypothesis of the lemma for e = e u . In 
particular, we have 

||««||l°°^0 and \\d au (j) u \\ L ^ -> 0. (11.81) 

Finally, suppose that for each u either (1) or (2) of the lemma fails to be true. We will 
see that this is impossible. 

As we did in Lemma 10.6, we assume that Wj is nonzero for any j between 1 and p, 
and that w p+ i — ■ ■ ■ — w n — 0. Let q = n — p, so that we have a splitting C n = C p x C. 
Let also ir p and n q denote the projections from C n to C p and C q . 

Define p u := \\d au (p u \\ L oo and s u := sup \7T p (f) u (C)\. Note that since (p u (C) C B(0, 2r) 
the numbers s u are uniformly bounded above. Let y u G C be a point where \7i p 4> u (C)\ 
attains the value s u , and let x u = <f> u (yu)- Passing to a subsequence if necessary, we can 
assume that the sequence p~ 1 s u converges somewhere in R> U {oo}. We distinguish two 
possibilities. 

Suppose first that p~ l s u — > e < oo. Let 4>' u := p^ 1 (0« — x u ). Define also I' u (resp. 
g u ) to be the pullback of I u (resp. g u ) under the composition of the translation along 
x u with the homotecy of ratio p. Then I' u and g' u are S' 1 -invariant and still satisfy the 
bounds in the hypothesis of the lemma. Furthermore, (a u , <f>' u ) is a J^-holomorphic pair. 
Finally, since H^au^HL 00 = 1) II^IU 00 ~~ > and \(f>' u (y)\ < e, the image <f>' u (C) is contained 
in a compact set independent of u. Hence we may apply Lemma 10.13 and deduce that, 
up to regauging, the pairs (a u , <p' u ) converge in C 1 norm to a J -holomorphic pair (0, (f>). 
So ||d0||x,oo = 1 (hence is not constant) and <9/ o = 0. 

Let av be as defined in (2) of Lemma 10.10. Since is holomorphic and not constant, 
by Lemma 10.10 we have 

\\<P&v\\mz n ) < e 2 + e - 2 ll^av||£g(Z 7 ) + \\<t> m\\ V (Z ,„)) ■ 

On the other hand, since the convergence <p' u — * <fi is in C°, it follows that <f)' u av converges 
pointwise to av . This implies that for big enough u we have 

lK,avlU 2 (^i/) < e l, 5 + e -i, 5 (ll^u,avlU 2 (^f) + II ^,av IU 2 (^///)) " 

Since <p' u is related to (p u by a translation and a homotecy, it follows that the same 
inequality is satisfied by (f> u . Finally, (11.78) in Lemma 11.8 implies (using the fact that 
||0m,o||l 2 converges to 0) that for big enough u 

Hufih^Zn) < e + e _ 1 (Hu,o\\LHZ 1 ) + \\<t>ufl\\L'i(Z ln ))- 

Hence (1) in Lemma 11.6 has to hold for big enough u. 
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On the other hand, since av is holomorphic and satisfies f av (t, v)dv = 0, standard 
elliptic estimates imply that, for some K independent of 0, we have 

sup |0 av | < K\\4> 

av 1 1 L 2 ■ 

Z 

Again using the fact that <p' u av converges to av we deduce that the same inequality holds 
for Uj av provided u is big enough and maybe after increasing slightly K. Finally, (11.79) 
in Lemma 11.8 implies a similar inequality sup z \4> u ,q\ < -^||0«,o||l 2 - But since ||0 u ,o||l 2 
is smaller than 1 for big enough u, this implies 

SUp |0 MjO | < K||0 u ,o||i2 4 . 

z 

Consequently, (2) in Lemma 11.6 has to hold for big enough u. And this is in contradic- 
tion with our assumptions. 

Now suppose that p~ l s u — > oo and define (p' u := s u " 1 (0 u — x u ). Then ||d Q „0ulU°° — * 0, 
so the diameter of 4>' u converges to 0. On the other hand each <f>' u intersects the set 

S = {( x , 0) e C p x C q | \x\ = 1} c C™. 

Then (1) in Lemma 11.9 below implies that for big enough u the pairs have to satisfy (1) 
in Lemma 11.6. On the other hand, (2) in Lemma 11.9 implies that for big enough u we 

have sup z |0^ o l — ^II0u,oIIl2 4 - Since UjO = s u (f)' u0 and s u is uniformly bounded above, 
maybe after increasing K we also have the following inequality for big enough u: 

sup \4> ufi \ < i^||0 u ,o||i 2 4 - 
z 

This implies that (2) in Lemma 11.6 has to hold for big enough u, leading to a contra- 
diction and finishing the proof of the lemma. 

Lemma 11.9. For any K > there exist numbers e = e(K, w) > and 5 = 5(K, w) > 
with the following property. Suppose that I (resp. g) is a smooth equivariant almost 
complex structure (resp. Riemannian metric) on C" such that — < e, \\DI\\loo < 

K, and \\D 2 I\\ L oa < K and \\g - g G \\ L °° < e and \\Dg\\ L oo < K. Let (a, <f)) : C x S" 1 -> C n 
be a I -holomorphic pair satisfying ||o:||l°° < e, \\d a (j)\\L^ < e, and 

<f>(C) GB(y, 5) GC n , 

where y is a point in S. Let ip : [—2, 3] — > C n and <p : C — > TC n be defined as in 
Theorem 11.1. 

(1) Suppose that \\da\\ L 2 < e||0 o ||,L 2 - Then 

Il0o|| + ll^lli 2 (z ffl )) • 

(2) There is a constant K' depending only on K with the following property. Suppose 
that || da || L 2 < K and ||da||L<x. < K. Then sup z \ 4>o\ < i^'H^oH^ 4 - 
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11.6. Proof of Lemma 11.9. We deduce the lemma from a local statement similar 
to Lemma 11.6 but accounting for the degenerate situation in which instead of a linear 
action of S* 1 on a vector space we have an action of R by translations. So now X denotes 
a constant vector in C n . 

If (a, 0) : C — > C n is a pair in which a = agd9, its energy density is defined by 



\d a c 



|2 



86 



2 



m ~ iaeX 



2 



dt 

Furthermore, the condition on (a, <p) of being J-holomorphic reads 

--4 ^ 

Lemma 11.10. For any K > there exists a constant e > with the following property. 
Let I be a translation invariant almost complex structure on C n satisfying 

\\I - Iq\\l°° < e, ll-D^IU 00 < K, and \\D 2 I\\l<x < K 
and let g be a Riemannian metric on C n such that || 

9 9o Wl 00 

< e and \\Dg\\ L oo < K. 

Take a I -holomorphic pair (a, <fi) : C — > C n satisfying 

\\o:\\l°° < €■> and ||c?o:0||l°° < e, (11.83) 
and de/me ^ : [-2, 3] -> C n and O : C 1 -»• TC n as in Theorem 11.1. 
(1) Suppose that \\da.\\i,2 < e||0 o ||L 2 - Then 

WUl^Zu) < —^—T (WMlnz;) + Uo\\h { z M )) ■ (H-84) 



e + e 

(2) There is a constant K' depending only on K with the following property. Suppose 
that ||do;|| L 2 < K and ||da||L<x. < K. Then we have 

sup|0 o | <K'U4%\ (11.85) 
z 

We now resume the proof of Lemma 11.9. To avoid confusion, let E denote C n with 
the linear action of S 1 and let F denote C n with an action of R given by translations 
along a vector X e F. We have the following analogue of the charts constructed in 
Lemma 10.4. Suppose that / (resp. g) is any ^-invariant complex structure (resp. 
Riemannian metric) on E. Let e > be as in Lemma 11.10. For any point y G S C E 
there exists a neighbourhood U x of y, a translation invariant complex structure I x and 
Riemannian metric g x on F satisfying \\I X — Iq\\l°° < e and \\g x — go|U°° < e (where Iq 
and go are the standard structures on F) and such that ||D7||ioo, ||D 2 /||icx) and ||Do||loo 
are bounded, and a complex isometry £ x : U x — > F. Then {U x } x€ s cover S. Consider a 
finite subcovering {U 1: . . . , C//}. For each open set Uj, let 

Kj :—max{\\DIj\\ L oo, \\D 2 Ij\\ Loo , \\Dgj\\ L oo}. 

Let €j be the value of e given by Lemma 11.10 for the choice K = Kj, and set e := 
min{ei, . . . , q}. There exists some constant 5 > such that any ball of radius 5 centered 
somewhere in S is contained in one of the U/s. Taking this value of 5, the statement of 
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Lemma 11.9 follows from considering one of the open sets (say Uj) which contains the 
image of <j), taking the composition <pj := ^ o and applying Lemma 11.10 to (a, (fij). 

11.7. Proof of Lemma 11.10. 

11.7.1. Proof of (1). We proceed by contradiction. Assume that there exist sequences 
of real numbers e u > 0, almost complex structures I u , Riemannian metrics g u on C n and 
J u -holomorphic pairs (a u , (p u ) with e u — > 0, 

\\ L oo < e u , \\DI u \\ Loo < K, \\D 2 I u \\ Lac < K, (11.86) 
Ibu - 9o\\l°° < e, H-D^IU- < K, (11.87) 

< e«, \\d au 4> u \\L^ < e u , (11.88) 

and such that defining ip u (resp. (f) u ,o) as ip (resp. 0o) in Theorem 11.1 we have the 
estimate 

||cfa u || L 2 < e u ||0 U)O ||L2, (11.89) 
and the inequality oposite to (11.84) is satisfied 

UuAh(z n ) > ~ 1 e _ 1 (ll0«,o|||2 (Z/) + UufiWl^Zu,)) • (11.90) 

Define 

<pu^(t,9) := 4>u{t,0) - ^- J (j) u (t,is)dis. 

It follows from (11.88) that ||0 u ,o||l°° - ► (unless we specify something different, all 
limits in this proof will implicitly mean as u goes to oo). Combining this fact with 
Lemma 11.8 and (11.90) we deduce that for big enough u we also have 

H0«,av||i2 (Zj/) > gL5 - g _ L5 (l|0u,av||L2 (Z/) + H^avlll^Z/jj)) ■ (11.91) 

Lemma 11.8 combined with (11.89) also implies that 

||da«||W||0 u , av || La -> 0. ( n - 92 ) 

Let 

2ir J 

Using a real linear transformation C" — > C™ we may assume that I u (x u ) = Iq and 
9u( x u) = go- Furthermore, since \\I U — io|U°° — ¥ 0, these linear transformations are 
uniformly bounded and (maybe after increasing K slightly) they preserve the bounds on 
DI, D 2 I and Dg. 

Using a gauge transformation we can assume that a u in balanced temporal gauge, so 
that a u = a u d6 for some function a u : C — > iR and the restriction of a u to {0} x S* 1 is 
equal to a constant \ u G iR. Then we have a u = X u d6 + /3 u d6, where 

|A U | < ||a u || L oo and ||AJ|l°° < 2||cfa u || L oo. (11.93) 
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Let us define 9) := (f) u (t, 9) + i\ u tI X — x u and 

a,av(t, 9) := Ut, 9 )-^J U*, ")dv- ( n -94) 

Then we have £, u ,av — 0«,av It follows that ||c?^m||l 2 ^ 0> for otherwise we would have 
£u,av = 0«,av = 0, in contradiction with (11.91). 

Lemma 11.11. The following holds: \\dj ^ u \\ L 2/\\d^ u \\ L 2 — > 0. 
Proof. Applying (11.82) to the pair (a u ,(fi u ) we compute: 

<9/ £ u = ^ - = (4(0«) - Io) (j^j- - i\uX^j - i(3 u I u ((j) u )X 



V 66 J (11.95) 

where in the second equality we have used the fact that I u is invariant under translations 
along X. We claim that for some constant K' depending on K 

\\Iu{iu + x u ) - I u {x u )\\ L 2 < K'\\di u \\ L 2. (11.96) 

Indeed, using (11.86) we deduce that for any x £ C™ at distance less than 1 from x u 
we have, for some K' depending on K, \\I u (x) — I u {x u ) — DI(x — x u )\\ < K'\\x — x u \\ 2 . 
Applying this to x = £ u + x u and using the fact that ||g?£ u ||lo° is uniformly bounded 
(which follows from (11.88) and implies that £ U {C) stays uniformly not too far from x u ) 
we get (11.96). The estimates (11.88) imply that Hc^H^sc — > 0, which on its turn implies 

<90. 



-^0. (11.97) 
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Also, combining (11.88) with (11.93) we deduce that 

A u ^0. (11.98) 
Finally, combining (11.92) with ||0 u , a v||L 2 = ||£«,av|U 2 < -^'||^«||l 2 we deduce 

A,/R«IU 2 ->0. (11.99) 
Combining (11.95) with (11.96) — (11.99), we obtain the desired limit. □ 

Now define £ u := Cu/IMCu||l 2 - Since J £^(0, v)dv = and ||c?C«I|l 2 — 1> the L? norm of 
£' u is uniformly bounded above, so there is a constant K such that 

i<HCIIl 2 <^o. 

Since the inclusion L\ C L? is compact, it follows that, up passing to a subsequence, 
there is some nonzero £ £ L 2 (C, C n ) such that £^ — > £ in L 2 . We claim that £ is in 
fact holomorphic in the interior of C. Indeed, for any test function g supported in the 
interior of C we have (£, d g)^ = lim(£„, d g) L i = \im(d£' u , g)^ = 0. Hence £ is a weak 
solution of d = 0, and from standard regularity results we deduce that £ is smooth and 
a strong solution: d£ = 0. Combining this with Garding's inequality (in the interior of 
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C): U'u ~ Chi ^ ~ OIU- + U'u - elUO we deduce that \\? u - £|U ? - 0, so £ 

converges to £ in L 2 and hence 

Udell* = i. (li.ioo) 

Define £' uav and £ av exactly as we defined £ Ujav in (11.94). Since £' u — > £ in L 2 , it follows 
that £(, ~~ * ^av in On the other hand, since is a rescaling of £ Uiav and we have 
£u,av = 0«,av, the formula (11.91) implies, passing to the limit, that 

||£av|||2(Z ff ) > e i. 5 + e -1.5 (|I^v||l3(Z 7 ) + ll&v 11*^)) ■ (11.101) 

This contradicts (2) in Lemma 10.10 (applied to = £) unless £ is constant, which is 
impossible by (11.100). This finishes the proof of (1). 

11.7.2. Proof of (2). We follow the same scheme of proof as before, assuming the exis- 
tence of sequences I u , g u , J u -holomorphic pairs (a u ,(f) u ), and e u — > satisfying (11.86)— 
(11.88) and 

\\da\\ L 2 < K, \\da\\ L oo < K, (11.102) 

but contradicting (11.85). We will see that this is not possible. Define £ u and £ Ujav as 
before. We state two lemmae whose proof will be given below so as not to break the 
argument. The proof of the Lemma 11.12 will be given in Section 11.8 and that of 
Lemma 11.13 in Section 11.9. 

Lemma 11.12. For big enough u we have ||£ U |U 2 < 2||£ Uiav ||L 2 = 2||0 Ujav ||x,2. 
Lemma 11.13. There is some constant K such that, for big enough u, HCuHl 2 < K . 

Let ||£ u ||i/2 denote the Holder C 0+ 5 norm of £ u . Since in real dimension 2 the Sobolev 
space L\ is included in C 0+ 5, Lemma 11.13 gives a uniform bound (here K may increase 
from one line to the other, but will always be independent of u) 

Uu\\l/2 < K . 

Taking this into account and applying Lemma 11.14 to £ M we obtain for big enough u 

sup \Q < ^(IIU* + IIU*) < ^oIlallK 4 , (11-103) 
z 

(recall that ||£«||l 2 goes to 0). It is easy to check that sup z |£„, av | < KqS\xp z |£ u |. Com- 
bining this observation with the equality £ M>av = (p u ^ v and (11.79) in Lemma (11.8) we 
deduce 

sup \(/) u ,o\ < K sup |£ u |. (11.104) 
z z 

On the other hand, combining Lemma 11.12 with (11.78) in Lemma 11.8 we conclude 

IIU* <^o||0«,oIU 2 . (n.105) 

Putting together (11.103) — (11.105) we deduce sup z \4> u ,o\ < -^oll^o!!^ 4 , so (11.85) must 
hold, in contradiction with our assumption. This finishes the proof of (2). 
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Lemma 11.14. There is a constant K > with the following property. Let f : C — > C™ 
be a smooth map and let ||/||i/2 denote its Holder C 0+ 2 norm. Then 

sup|/|<k (||/||K 4 ||/||^ + ||/||l 2 )- (H-106) 

z 1 

Proof. Denote for convenience H := ||/||i/2, and let x G Z be such that \f(x)\ = sup |/|. 
Let 

n= \mi 

' AH 2 ' 

Then for any y G C such that \x — y\ < p we have |/(y)| > \f(x)\/2, because 

M < i^w-ffi < H \m\ > i/wi - > | /( , ( | _ \m. 

p y\ 

Now suppose that p < 1, so that the ball B of radius p centered at x is contained in C. 
Then \\f\\L 2 > ||/||l 2 (b) > -^oP 2 1/(^)1) which implies, rearranging, that 

\f{x)\<K Q \\f\\^\\f\\% (11.107) 

On the other hand, if p > 1 we estimate < -KoII/Hl 2 - Summing the two inequalities 

we obtain (11.106). □ 

11.8. Proof of Lemma 11.12. Pick some big u and let / := £ u and / av := £ Ujav - By 
definition we have 

|/(M)-/av(M)| = ^ / f(t,v)dv=-±- f [%(r,u)dudr, 

27r J *n J J at (11.108) 

since J /(0, v)dv = 0. On the other hand, denoting <9/ by <9 we have 

^(t,v) = I ^(t,v)+df(t,v). 

Integrating for 9 G S 1 the first term in the right hand side vanishes, so we obtain 

" fdf, 



[t, v)dvdr 



<K\\df\\ L , (11.109) 



for some constant K. Let K' be the constant in Garding's inequality 

IM/llL 2 <ll/lb<^(P/|U 2 + ||/|U 2 ). 



Using Lemma 11.11 we know that, if u is big enough, ||9/|| L 2 < l/2K'\\df\\ L 2. Rear- 
ranging the terms in the inequality, this implies that ||d/||i,2 < 2fr / ||/|| L 2. Using again 
Lemma 11.11 we conclude that for big enough u we have ||<9/||l 2 < (2if Vol(C) ) — 1 1 1 / 1 1 ^2 . 
Taking L 2 (C) norms in (11.108) and combining the previous inequality with (11.109) we 
obtain 

U 2 < ||/av||L 2 + i|mlL 2 . 



2 l 

Hence, ||/||l 2 < 2||/ av || i2 , which is what we wanted to prove. 
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11.9. Proof of Lemma 11.13. As before, throughout this proof K will denote a pos- 
itive number which may increase from line to line but which will always be independent 
of u. We estimate the L\ norm of dj C, u using formula (11.95). First, since DI is uni- 
formly bounded and the diameter of £ U (C) tends to (because of (11.88)), we have 
||-f«(£u + x u ) — I u (x u ))\\l°° — > 0. Hence, for any 5 and big enough u we have 

< \\IJ£u + x u ) - I u (x u ))\\ Lx \\i u \\ L 2 

Li 

+ \\Iu(£u + Xu) - Iu{Xu))\\liMu\\l°° 

< $Mu\\l 2 2 + \\Iu(£u + Xu) -/«(a; u ))|| L 2||^ u || L oo. 
(Note by the way that d^ u /d6 = d<f) u /d8.) Now we can estimate 

\\I U (tu + Xu) ~ I u (x u ))\\ L 2 < K \\£ U \\ L 2 

We also know that if u is big enough ||£«||l2 < 2||£ M || i 2 (this follows from Lemma 11.11 
and Garding's inequality, see for example the argument in Section 11.8). Furthermore, 
||£u||l 2 g° es to 0. Also, since f £ u (Q,v)dv = and the diameter of £„(C) goes to 0, 
< Kq holds if u is big enough. Putting these observations together we obtain 

<5\m\ L 2 + K . (11.110) 

For the next term in (11.95) the following is easy to prove (if u is big enough), taking 
into account the previous arguments: 

\\{I u ^u + x u )-I u {x u ))i\ u X\\ Ll <K Q \\^ u \\ Ll <K . (11.111) 

The remaining term satisfies, provided u is big enough, 

\\PMZ»)X\\ Ll < K (\\p u \\ Ll + ||/3 u || L oo||e u || L 2) 

< K {\\da u \\ L 2 + K\\da u \\ L °*\\i u \\ L 2^) 

<K (\\da u \\ L 2 + U u \\ Ll )<K , (11.112) 

where here we use (11.102) and the inequality ||/3 U || L 2 < _K" ||c?q; u || L 2, which is easy to 
prove. Taking this into account and combining (11.110), (11.111) and (11.112), formula 
(11.95) implies that ||9/ ^ U || L 2 < <5||£„|| L 2 + K . Finally, using the standard inequality 

< K'(\\d Io U\q + Uu\\q) 

and taking S smaller than 1/(2K') we obtain the desired bound. 



X u ) luiXu)) 
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(-Ai(£u Xu) Iui,X u )) 



Recall that H = 
gradient of H . 



12. Limits of approximate gradient lines 

— i/i. We denote for convenience V := IX, so that V is the negative 
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Theorem 12.1. Let a > be a real number. Suppose that {i(j u : T u — > X,l u ,G u } is a 
sequence of triples in which each ip u is a smooth map with domain a finite closed interval 
T u C M, each l u is a nonzero real number and each G u > is a real number. Suppose 
that for each u and t G T u we have 

W u {t) - l u V(Mt))\ < G u e'^ dT -\ (12.113) 

Suppose also that G u — > and that l u — > 0. Passing to a subsequence, we can assume 
that l u \T u \ converges somewhere in MU{±oo} (here \T U \ denotes the length ofT u ). Then 
we have the following. 

(1) If \iml u \T u \ = then limdiam'?/' n (T u ) = 0. 

(2) If\iml u \T u \ 7^ ; define for big enough u and for every t G S u the rescaled objects 
S u := l u T u and f u (t) := ip u (t/l u ). There is a subsequence of {f u ,S u } which 
converges to a chain of gradient segments T in X . 

Proof. The case l u \T u \ — > is obvious, so we consider the case lim/jTj > 0. We begin 
modifying slightly the definition of S u . Taking u big enough we can assume that l u < 1. 
Suppose that T u = [a,b] and let T' u := [a, a — lnl u /a] and T" := [b + \nl u /a,b]. Using 
(12.113) we can bound 

r r-lnlu/cr 

diam(^ n (Tj) < / l u \V\dt + G u e~ cx dx 
Jt' u Jo 

< / u (sup \V\)\ \nl u \/a + G u (l - l u )/a -> 0, 
and similarly diam(^ u (7^')) — > 0. Consequently, if we define 

S u := [(a - \nl u /a)/l u , (b + \nl u /a)/l u ] and /„(*) := ip(t/l u ), 

(12.114) 

then the statement of the theorem is equivalent to saying that the sequence (f u , S u ) 
defined by (12.114) has a subsequence converging to a chain of gradient segments T. 
Furthermore, equation (12.113) implies that for any t G S u we have 

- V(fu(t))\ < G u e- ad ^' dS ^ < G u e- ad ^ ds -\ (12.115) 

Lemma 12.2. For any connected component F C F of the fixed point set and any small 
enough 5 > there exist numbers < d < 5, r\ and c > with the following property: 
let f : [a,b] — > X be a map satisfying \f'(t) — V(f(t))\ < r] for every t and, for some 
a < t < t' < b, 

fit) G F d and f(t') $ F 25 

(recall that F$ and F 25 denote the d and 25 -neighbourhoods of F respectively). Then, 
for every t" > t' + 5 we have 

H(f(t")) < H(F ) - c. 

In particular, if d is small enough the for every t" > t' + 5 we have fit") £ F d (i.e., f 
never comes back to Fq after time t' + 5). 
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The proof of Lemma 12.2 will be given in Section 12.1. Now we continue with the 
proof of Theorem 12.1. Passing to a subsequence, we can assume that there is some 
K > such that for any connected component F' C F either lim^oo d(ip u (S u ), F') = 
or d(ip u (S u ) , F') > K for every u. Let C F be the connected components 

which fall in the first case. We claim that the values of H in each of the components 
Fi, . . . , Fi are all different. 

To prove this, suppose on the contrary that for some % and j we have H = H(Fj) = 
H(Fj). Take some small 5, and let < d < 5, r\ and c be the numbers obtained by taking 
F := Fi in Lemma 12.2. Let also a > be so small so that H(Ff) C [H-c/2, H-c+2]. 
Then, if G u is smaller than rj, and for some t we have f u {t) G Ff, then for every t' > t 
we have f u (t) ^ F". Interchanging the roles of Fi and Fj we prove in the same way that 
there are some d', a' such that if f u (t) G Ff' and t' > t then f u (t') ^ Ff . So we cannot 
have simultaneously lim^oo d(ijj u (S u ), Fi) = and lim^-n*, d(ijj u (S u ), Fj) = 0. 

Hence we can suppose that H(Fi) > H(F 2 ) > • ■ • > H(F{). Now we pick some 
very small 5. Suppose that u is so big that we can apply Lemma 12.2 to each of the 
components F ± , . . . , Let di, . . . , di be the numbers given by Lemma 12.2 for our choice 
of 5. Then we define, for every 1 < j < /, 

E s uJ := [m£{t G S u \ f(t) G Ff}, \nf{t G S u \ f(t) $ F 2 k s } + 5]. 

Let E s u be the union x U • • • U E & Ui and let denote the closure of the complementary 
Su\E s u . 

Passing to a subsequence we can assume that for all 5 and u all sets have the same 
number of connected components: T% = U • • ■ U , where p lies between / — 1 and 
I + 1, and that the inequality T% 1 < E^ 1 holds either for all u or for none of them; if the 
inequality holds and furthermore 

limsup limsup diam(/ n (T^ )) = 0, 

then we remove 1 from and attach it to E^ x . Similarly, we remove T^ p from if 
the diameter of its images converges to 0. After this operations we end up with a new 
set Tl = U • • • U T^ k for each u and 5. 

It follows from the definition of E 5 U ^ and from Lemma 12.2, is that for any t 6 
the point f(t) lies away from the set X' = F^ 1 U • • • U Ff 1 . Since sup x \ X / \ V\ > 0, it 
follows that if G u is small enough then the size of f'(t) is comparable to that of V. As 
a consequence, for fixed S the intervals ■ have bounded length. From this it is rather 
straightforward to prove, passing to a subsequence, that (6.21) holds and that there is a 
limiting gradient segment (x^Tj) to which the images of converge. 

Hence we only need to prove that the images of the intervals E 6 U - accumulate near 
the fixed point component Fj and that their diameters tend to 0. The former is almost 
obvious from the definition, whereas the latter is a bit more subtle and follows from next 
lemma, using the fact that ■) C F 25 . □ 
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Lemma 12.3. There is a constant K > with the following property. For any small 
enough 5 > 0, any big enough u (depending on 5), and any interval E C S u such that 
fu(E) C F s , we have diam(/ u (£)) < 5K. 

Proof. If 5 is small enough we can assume that f u {E) lies in the ^-neighbourhood F$ of 
a unique connected component Fq C F . Let N be the normal bundle of Fq. Using the 
exponential map with respect to the S^-equivariant metric g, we can identify Fq with 
a neighbourhood N 5 of the zero section of N. Then we can pullback the vector field V 
to a vector field on N s , which we denote by the same symbol V. We also think of 
as taking values in N 6 . Consider on iV the restriction of the metric g. This gives an 
equivariant Euclidean metric on N. 

The bundle N carries a linear action of S 1 , and we can split N — N± © • • • © Nk in 
such a way that S 1 acts on Nj with weight Wj ^ 0. Let V G T(TN vcrt ) be the vertican 
tangent field whose value at a vector x = (x 1 , . . . , x^) G N is V (x) := (wiXi, . . . , WkXk) 
(here we are identifying TN vert ~ N). It is well known that Vq approximates at first 
order V near Fq (which we view as the zero section of N). More precisely, there is a 
constant K such that for any x G N 5 we have 



Let d : F — > R be the function defined as <i(t) := |/ u (t)| 2 . We want to prove that 
decays exponentially as t goes away from the extremes of E. We will follow the same 
idea as in the proof of formula (11.60) in Theorem 11.1; however, in this situation the 
analysis will be much simpler. 

Shifting E (and modifying accordingly f u ) and removing if necessary a small interval of 
length < 2 at the end of E, we can assume that E = [-L, L], where L is a natural number 
(it is clear that the truncation does not affect the estimate). Define for every natural 
number -L < n < L the energy d n := |/ u (f)|df. Define also g n := f™ +1 \f u {t) - 



e > such that, for every —L < n < L — 1 satisfying g n _\ + g n + g n+ i < ed n , we have 



The proof of Lemma 12.4 will be given in Section 12.2. We now finish the proof of 
Lemma 12.3. Inequality (12.115) implies that g n < G u e~ a( - L ~ n \ Arguing exactly as in the 
proof of Lemma 10.9, we deduce that for some (3 > independent of f u there is a bound 
d n < (e' 1 KG u + d^ L + d L ) e -^ L -^\ Then we have (using the fact that \V(x)\ < K\x\) 



V {x) - V(x)\ < K\x 



(12.116) 




d n < l(d n -l + d n +l). 




<K(G u + d_ L + d L )<K(G u + 5). 
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Taking u big enough so that G u < 5, the result follows. □ 

12.1. Proof of Lemma 12.2. We can assume without loss of generality that r] < 1. 
Define M : = sup \ V\ + 1. Then for every t we have 

\f'(t)\<M. (12.117) 

Let h := H(f) and let h := H(F ). We have h! = -(V(f), /'), so that 

\h'(t) + V(f(t))\ 2 <rj\V(f(t)\. (12.118) 

This implies the following: for every 5 > there exists some K > such that, if 
|r?| < AT/2 and f(t) G X 5 then h'(t) < K/2. 

Lemma 12.5. Given any small e > 0, if rj is small enough (depending on e) we have, 
for any t, 

(1) if fit) G X eMU and h(t) < h + eK/A then, for every t' > t + e, we have 
h(t') <h - eK/A. 

(2) if h{t) <h - eK/A, then for any t' >t we have h(t') < h - eK/A; 

(3) if h(t) <h + eK/A, then for any t' > t we have h(t') < h + eK/A. 

Proof. Suppose that rj is so small that whenever f(t) e X 5 we have h'(t) < K/2. It 
follows from (12.117) that if f{t) e X eM+s , then for every r G [t, t+e] we have /(r) G X s , 
so h!{r) < K/2. Integrating we obtain (1). We now prove (2). If e is small enough then 
V does not vanish in {H = ho — eK/A}. Taking rj smaller than one half of the supremum 
of \V\ on the level set {H = h — eK/A} we deduce from (12.118) that for any r such 
that /(r) = h — eK/A we have h'(r) < 0. This clearly implies (2). The same argument 
proves (3). □ 

Take any e satisfying the hypothesis of the previous lemma and also e < M~ 1 5. Take 
d < 5 small enough so that we have an inclusion 

F^C {\H-h \ <eK/A}. 

Define also c := eK/A. We claim that this choice of d and c satisfies the requirements 
of the lemma. Indeed, suppose that for some t < t' we have f(t) G Fq and fit') ^ Fq S . 
Then h(t) < h + eK/A, so by (3) in Lemma 12.5 we also have h(t') < h + eK/A. If 5 is 
small enough so that the 25-neighbourhoods of each connected component of F are all 
disjoint, then we can assume that f[t') G X 25 C X eM+s . Combining this fact with the 
bound on hit') and (1) in Lemma 12.5 we deduce that h{t' + e) < ho — K/A. Finally, 
(2) in Lemma 12.5 implies that for every t" > t' + e we have h(t") < ho — K/A, which is 
what we wanted to prove (recall that M > 1, so e < M^5 < 5). 

12.2. Proof of Lemma 12.4. Suppose that / u \e takes values on the restriction N' of 
N to the connected component F' C F of the fixed point set. Let N' — N± © • • • © 

be the decomposition in weights of the S 1 action, and denote the corresponding weights 
by w 1 , . . . ,Wk- Let also A denote the endomorphism of N acting on the subbundle Nj 
as multiplication by Wj. 
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Define, for every t G [-1,2], (f>(t) := f u (n + t), G(t) := <j/(t) - V(<f>(t)) and H(t) : = 
V(<f)(t)) — Vo((f)(t)). Let (xi, . . . ,Xk) be the coordinates of 0(0). The relation between 
and the integral curves of the linear vector field Vq is given by Duhamel's formula: 

0(t) = e tA 0(O) + f e^ A (G(s) + H(s))ds 
Jo 

= {e wit x u e Wkt x k ) + [ e {s - t)A {G(s) + H(s))ds. 

Jo (12.119) 

Let 0o (f) := (e"' 1 *a; 1 ,...,e u ' fc 'a: fc ) and let := £ e (s ^ )A (G(s) + H(s))ds. Define also 
^ := fj_™ +1 \(/>o(t)\dt for every jG{n-l,n,n + 1}. One checks easily (see for example 
the proof of Lemma 10.11) that 

tn < — -^(fn-i + t n+1 ). (12.120) 

e + e - 1 - 

By assumption we have 

11^(01^1(7) < (5T n _i +5n + 5'n+l) < e^n- 

On the other hand, using (12.116) and |0| < KS we obtain 

\\H(t)\\ LHl) < K Q^^j < ^||0|| L1(7) . 
Since e^ s ~^ A is bounded for s,t G [—1,2], we can bound 

||x||li(7) < X(||G(0||Li(7) + ||H(*)IUi(J)) < iv(erf n + 5||0|| L1(/) ). 

Taking this estimate for / equal to [n — 1, n], [n, n + 1] or [n + 1, n + 2] it follows from 
(12.119) that, for any j G {ni,n, n + 1}, the inequality \dj — < KS(ed n + holds. 
Combining this estimate with (12.120), and taking into account that 7 > l/(e + e _1 ), 
we deduce that if 5 and e are small enough then we must have 

d n < l{d n -i + d n+ i), 
which is what we wanted to prove. 
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